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Preface 


Theoretical industrial organization has made substantial 
progress since the early 1970s, and has become a central 
element of the culture of microeconomics. This book is an 
attempt to give a straightforward account of the recent 
developments and to blend them into the tradition of 
industrial organization. 

For advice, encouragement, and criticism I am partic- 
ularly indebted to Philippe Aghion, Roland Bénabou, 
Patrick Bolton, Bernard Caillaud, Franklin Fisher, Paul 
Joskow, Bruno Jullien, Eric Maskin, Patrick Rey, Garth 
Saloner, Richard Schmalensee, and Michael Whinston. 
Dilip Abreu, Kyle Bagwell, John Bonin, Joel Demski, Peter 
Diamond, Drew Fudenberg, Robert Gertner, Robert 
Gibbons, Roger Guesnerie, Oliver Hart, Bengt Holmstrom, 
Jean-Jacques Laffont, Ariel Rubinstein, Stephen Salant, 
Steve Salop, Carl Shapiro, Andrea Shepard, Marius 
Schwartz, and Oliver Williamson offered very useful com- 
ments on specific chapters. 

My debt to Paul Joskow and Richard Schmalensee— 
who encouraged me to undertake this endeavor, who 
read the entire manuscript, and who made pages and 
pages of critical comments—goes back to my days as a 
student at MIT. They are still teaching me about indus- 
trial organization. Former MIT students will recognize in 
the organization and in the choice of topics the influence 
of course 14.271, which Paul and Richard molded. My 
debt to Drew Fudenberg and Eric Maskin also goes back 
to my student days, While this debt extends well beyond 
the area of industrial organization, | must acknowledge 
that my vision of the game-theoretic aspects of industrial 
organization has been shaped by our ongoing collabora- 
tion. Eric taught me game theory and showed me how its 
tools could fruitfully be applied to various economic ques- 
tions. Drew's input into this book is almost too obvious 
to be acknowledged, Part II and the Game Theory User's 
Manual borrow heavily from our joint research and sur- 
veys. David Kreps, Paul Milgrom. John Roberts, and 
Robert Wilson have also greatly influenced my views on 
strategic behaviors in markets, My intellectual debt ex- 


tends to fhe many fine researchers who have built the 
moder tlteory of industrial organization and whose work 
I cite extdnsively. 

The miterial in this book was taught in various forms 
at the unflergraduate level at the Ecole Nationale de la 
Statistiqu¢ et de I'Administration Economique and at the 


lecture nptes on industrial organization became Con- 


parfaite, published by Editions Economica in 


nomics eHitor Terry Vaughn, for his help and encourage- 
ment, an{l to designer Rebecca Daw, for a nice treatment 
of some paturally difficult material. 

Very 4pecial thanks go to the skillful and multilingual 
Emily Ghllagher, wha suffered through the typing of 
many drpfts, both in French and in English. With good 


xii 


cheer, she spent countless evenings and weekends so that 
the material could be ready for my students and then for 
publication. She did a beautiful job. The first French- 
language draft was typed by Patricia Maillebouis and 
Pierrette Vayssade. 

| gratefully acknowledge generous research grants from 
the National Science Foundation, the Sloan Foundation, 
the Center for Energy Policy Research at MIT, the MIT 
Fund for the Arts, Humanities, and Social Sciences, and 
the Commissariat Général au Plan. 


Jean Tirole 
Cambridge, Massachusetts 


Preface 


Introduction 


Why Should One Be Interested in Industrial 
Organization? 


This question sounds almost silly. To study industrial 
organization is to study the functioning of markets, a 
central concept in microeconomics. However, it took a 
long time and two waves of interest for industrial organi- 
zation to become one of the main fields of economics.! 

The first wave, associated with the names of Joe Bain 
and Edward Mason and sometimes called the “Harvard 
tradition,” was empirical in nature. It developed the fa- 
mous “structure-conduct-performance” paradigm, accord- 
ing to which market structure (the number of sellers in 
the market, their degree of product differentiation, the 
cost structure, the degree of vertical integration with sup- 
pliers, and so on) determines conduct (which consists of 
price, research and development, investment, advertising, 
and so forth), and conduct yields market performance 
(efficiency, ratio of price to marginal cost, product variety, 
innovation rate, profits, and distribution). This paradigm, 
although plausible, often rested on loose theories, and it 
emphasized empirical studies of industries. For instance, it 
was generally found that certain measures of conduct and 
performance were strongly linked with the market's struc- 
ture—a typical regression had the form M1; = f(CR,, 
BE,,...), where i denoted the industry, M1, denoted some 
measure of firm or industry profitability, CR; was a con- 
centration ratio (a measure meant to summarize how non- 
competitive the industry is), and BE, (for barriers to 
entry) referred to variables that measured the difficulty of 
entering the industry (approximated by the minimum effi- 
cient scale of entry, the ratio of advertising to sales, and 
so on). Other variables could be introduced in the regres- 
sion as well. The regression was run on cross-sectional 
data for a large sample of industries. 


1, Th)s section draws on lectures given by Paul Joskow and Richard 
Schmilensee, 


2. See Schmalensee 1986 for an excellent critical survey of this approach. 


If one igndred the many issues having to do with 
measurement, buch regressions produced a useful array of 
stylized facts. |The links (or the absence of links) between 
variables musi, however, be interpreted as correlations 


ratio, and the} ratio of advertising to sales are jointly 
endogenous. hey are determined simultaneously by the 
market's “basi conditions” (exogenous variables) and the 
behavior of thp firms. 

The absencg of a causal interpretation is troubling for 
an analyst. What is to be made of a regression showing 
that the rate pf return in an industry grows with the 
concentration jn that industry?* Well, it may suggest that 
there is markt power in fairly concentrated industries 
and that the pfrformance of such industries might not be 
optimal. Howdver, it says little about the causes of con- 
centration or fmarket power, and it fails to guide our 
analyst as to whether, and in what form, government 
intervention cdn improve market performance. 

The empirichl tradition certainly tried to measure more 
basic (exogenofis) conditions: technology (retums to scale, 
entry cost, proportion of capital sunk, existence of a learn- 
ing curve, durpble versus nondurable good, etc.), prefer- 
ences and corfsumer behavior (structure of information 
about product quality, reputation and brand loyalty, etc.), 
“exogenous” ttchnological change, and so on.* Although 
progress has been made in this direction, it is often 
difficult to gather data that are accurate measures of basic 
conditions andjare comparable across industries. 

The precediftg comments are certainly too harsh a judg- 
ment on the dmpirical tradition, which, after all, set an 
agenda for indlistrial organization. | left unmentioned the 
fact that man informal stories were built around the 
regressions. (Actually, industry case studies preceded the 
wave of regreqsions and were made possible by antitrust 
cases. These dase studies led to some informal stories, 
Regressions offered complementary material on industry 
behavior.) Thqse stories, together with antitrust analysis 


3. Actually, this stftistical relationship is weak. Introducing a market-share 
variable on the righ}-hand side of the regression tends to wipe out concentra- 
tlon-ratio effects. Cre explanation may be that industries with cost hetero- 
geneity between firhs tend to yield high concentration ratios (a few low-cost 
firms produce most hf the output) as well as high profits (the low-cost firms are 
relatively free of cqmpetitive pressure from their rivals). The concentration 
ratio variable might{be picking up the effect of the missing-market-share vari- 
able, See chapter 5 


and case studies, supported the subsequent theoretical 
wave. Nor is it fair to say that more formal theory was 
completely left aside. In particular, the “Chicago tradi- 
tion,” starting with Aaron Director and George Stigler, 
emphasized the need for rigorous theoretical analysis 
and empirical identification of competing theories, The 
Chicago tradition had an important methodological im- 
pact on the development of the field; it is also famous 
for its very permissive view of market behavior—for 
instance with respect to vertical restraints and predatory 
pricing (collusion is the main offense, on its view)—and 
for its relative distrust of government intervention in 
comparison with the Harvard tradition, Still, by the early 
1970s it was felt that in many instances theory was more 
a way of explaining statistical results or of buttressing 
particular intellectual positions than a rigorous and sys- 
tematic investigation. For instance, Paul Joskow offered 
the following observation: 


Ina sense, the ultimate test of the utility of the various models 
is whether they prove useful to people involved in analyzing 
problems involving actual markets or groups of markets, I 
suggest that not only aren't they particularly useful but also 
that they aren't really used... Somehow one gets the distinct 
feeling that the important messages are being carried by the 
informal theories, stories, and behavioral observations, and 
that the formal models are trotted out ex-post to demonstrate 
that some kind of formal apparatus can explain or incorporate 
some of what is actually being observed. (Joskow 1975, p. 273) 


The second wave of interest, which was mainly theo- 
retical, started in the 1970s. It can be attributed to three 
factors. I have already mentioned, on the “demand side,” 
the growing dissatisfaction with the limits of the cross- 
sectional empirical analysis that had come to dominate 
the field of industrial organization. There was a wide- 
spread feeling, exemplified by the above quote, that em- 
pirical work did not appeal to particular formal models of 
oligopoly markets. On the “supply side,” two factors can 
be discerned. First, until the 1970s, economic theorists 


4. For instance, the returns to scale have been measured by various methods, 
‘The most common may be the estimation of a cost function (estimation of the 
parameters of a cost function of output level and input prices). Bain obtained 
“engineering production functions" from engineering data at the plant level: 
he also used “engineering managerial analysis’ —asking managers what the 
optimal plant size is, for instance. Still another method is George Stigler's 
survivorship technique, which looks at the size of surviving firms. 
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(with 4 few exceptions) pretty much ignored industrial 
organidation, which did not lend itself to elegant and 
general analysis the way the theory of competitive gen- 
eral equilibrium did. Since then, a fair number of top theo- 
tists Me become interested in industrial organization.* 
Second, and crucial for the topics reviewed in part I 
of this} book, noncooperative game theory imposed it- 
self (ngt without some dissension) as the standard tool 
for the analysis of strategic conflict, thus bringing a 
unified] methodology to the field. Furthermore, it made 
serious progress in two crucial areas: dynamics and 
asymmetric information. The stage was thus set for a 
reappraisal of the many informal stories that were floating 
around. 

Thi book is mainly concerned with the accomplish- 
ments fof the second wave, but I have tried not to forget 
that earlier contributions laid the foundations for this 
theoretical work. | think there is now sufficient agreement 
about the methodology to be employed in the theoretical 
study pf industries to warrant such a text. 

Ihave mentioned some historical developments without 
one industrial organization (IO) or emphasizing its 
importance. I would actually like to avoid giving a precise 
definition of the field, as its frontiers are fuzzy. |O cer- 
tainly |begins with the structure and behavior of firms 
(markdt strategy and internal organization). This business- 
strategy aspect may explain why a few of the outstanding 
contributors in the area have appointments in business 
schools. But there is more to IO than business strategy. 
The ofher side of the coin is the outsider’s (the academic 
economist's, the civil servant's, or the antitrust practi- 
tioner’s) assessment of market efficiency. Imperfectly com- 
petitive markets (that is, most real markets) are unlikely to 
maximize social welfare. This does not necessarily mean 
that the government (the “social planner’) can improve 
on thd private outcome given its structure of information, 
nor dpes this observation indicate when and how the 
goverhment should intervene. It does tell us that analyses 
that rely on models of perfect competition may be quite 


unsatisfactory from a positive and from a normative 
perspective. 

The scope of government intervention is itself ill de- 
fined. Roughly, the promotion of competition through 
antitrust action, as well as certain forms of “market regu- 
lation” ® (taxes and subsidies, minimum quality standards, 
etc), will be considered to lie within the scope of this 
book. Other instruments, such as price and entry or 
economic regulation at the firm level (including monop- 
oly franchising, governmental procurement, and national- 
ization) will not. This division has some unfortunate effects. 
In particular, 1 will not be considering all modes of inter- 
vention in my models. My only excuse (and one that I 
will use for other purposes) is that the book is already 
very long. Considering selective regulation could well 
have doubled its length. 


Leen eee EEEEEaEn 
Theory versus Evidence 


Industrial organization has become a fairly theoretical 
field in recent years. At first sight, even a theorist should 
regret the very high ratio of theory to evidence in a field 
in which theoretical models are often lacking in generality 
and in which practical implications are so crucial. While | 
feel there is an imbalance in the field, I also think the 
theoretical evolution has been very healthy. 

The “new theoretical IO,” having drawn from the old 
oral tradition of behavioral stories as well as from stylized 
facts,” can, I believe, help the people involved in ana- 
lyzing actual markets. Not only has it formalized some 
of the old informal stories; it also has rejected others. | 
do not want to overemphasize the practical contributions 
of the theory. It may have put too much emphasis on 
positive (explanatory) analysis, to the detriment of nor- 
mative (welfare) analysis, and it has. done too little to help 
practitioners distinguish between competing theories, But 
it definitely has practical content. 

Furthermore, the theoretical contributions should soon 
feed back to empirical analysis.* They suggest what evi- 


'5, This hardly explains the “performance” of recent theoretical research in 10. 
The “‘stfucture” of the market for theoretical research in the field (e.g., the 
numbet|of top theorists with permanent or temporary interest in the area) is 
itself erflogenous. One must look for more basic conditions that explain the 
inflow, 


©. By “market regulation” | mean regulation that treats all firms in an industry 
(including potential ones) symmetrically. 


7. Scherer 1980, a remarkable collection of facts about firm behavior, has been 
very instrumental in the development of the current theory. 


8. The new intraindustry empirical studies are 2 good sign that such an 
evolution might take place. Some excellent contributions along these lines are 
Bresnahan 1987a, Joskow 1985, and Porter 1983. See Bresnahan 1987b for a 
useful survey. 
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dence to look |for, separate the endogenous from the 
exogenous varipbles, and highlight the hypothesis to be 
tested. 

Econometric} analysis. certainly isn’t the only way of 
doing empirical research in IO, Because of unsatisfactory 
data, many apphied researchers are paying more attention 
to the development of evidence on firm and industry 
behavior and performance through detailed case studies 
of firms or indifstries (to which one can add the evidence 
accumulated fof antitrust purposes). Although these stu- 
dies have their}own drawbacks, they have yielded many 
interesting insiphts. Indeed, IO theorists have often felt 
more comfortable with case studies than with statistical 
analysis—perlaps because it may be easier to recover the 
industry‘s basif conditions and behavior from rich case 
studies than from selective statistics about profit, concen- 
tration, advertjsing, and so on drawn from a very large 
sample of dispfrate industries, 

Still anothe} method of collecting evidence that can 
benefit from tle theoretical developments is the running 
of controlled experiments in laboratories.” 

‘Thus, it is hpped that these three approaches to empiri- 
cal work will]be strengthened by the new theoretical 
developments] The book does not list the empirical impli- 
cations of each model and does not explain how com- 
peting model might be distinguished. However, | hope 
that the preseitation of the models is intuitive enough to 
highlight thei testable features. 


Scope of the Book 


The book ddes not cover the empirical side of the 
field (including, the antitrust experience), It also ignores 
some of the proad theoretical issues, such as economic 
regulation, infernational industrial organization,” imper- 
fect competitjon in general equilibrium,'' and the link 
between IO ahd macroeconomics.'? 

The methcdology is also defined narrowly. Part I as- 
sumes optimiting behavior, and part Il uses a generaliza- 
tion of optim}zation to multiperson decision making: the 
theory of ndpcooperative games. The book does little 


justice to alternative approaches, such as satisficing 
(bounded rationality). The gain from this omission is a 
unified treatment. 

To simplify things, | treat firms as single decision 
makers that maximize profits. In most of the book, prob- 
lems of managerial control by shareholders, banks, or the 
capital market are assumed away. Delegation and control 
within a firm are also ignored. The preliminary chapter on 
the theory of the firm discusses these assumptions, Some 
allusions are made to agency problems in chapters 4 and 
9, Because a fair treatment of agency theory would re- 
quire a book in itself, 1 content myself with mentioning 
the issues. The topics will cry out loud for further devel- 
opment. And, indeed, I believe that the intersection be- 
tween organization theory and IO is one of the most 
interesting areas for theoretical research in the years to 
come. ‘ 

I clearly had to choose which topics to emphasize—a 
sometimes painful exercise. Although my choices reflect 
my own preferences, they should not necessarily be mis- 
taken for value judgments. First, they are contingent on 
my current state of knowledge and reflection. | apologize 
to the authors whose contributions | underemphasized 
or left out because of ignorance, imperfect recall, or in- 
sufficient perspective, Second, the choices are sometimes 
guided by an expositional strategy. Some interesting con- 
tributions that would require long or technically difficult 
exposition are relegated to footnotes, remarks, or exercises. 


How to Use the Book 
General Organization 


Part | (chapters 1 through 4) looks at those features of 
market behavior that are not related to (but are certainly 
not inconsistent with) strategic behavior. It considers a 
monopolist’s choices of price and quality, the spectrum of 
goods, advertising, and the distribution structure. Most of 
the conclusions obtained there carry over to oligopolies. 
Part Il analyzes the choice of price, capacity, product 
positioning, research and development, and other strate- 


9. Fora survey ofthis approach to 10, see Plott 1982, 


10, See Helpman }nd Krugman 1985 for a recent contribution to IO in an open 
economy. 


11, See Hart 1985 for a good survey of the corresponding literature, 
12, See Carlton 1987, 
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les in oligopoly. It makes heavy use of some 


game thpory would be useful, but it is by no means a 
prerequigite for part II. I suggest that those unfamiliar 
with garhe theory read up to section 11.4 of the “user's 
manual” while progressing through part I of the book and 
read sections 11.4 and 11.5 before studying chapter 9. 

As was mentioned above, part | is preceded by a dis- 
cussion pf the theory of the firm. The firm is the basic 
object off the book, and therefore we ought to inquire into 
its naturp and objectives. The study of the firm is a pre- 
liminary to the analysis of markets. | fear (and, at the same 
time, hope) that the reader will find the discussion some- 
what unjatisfactory. As it stands, it may seem intended to 
ease my |conscience (Now that we have talked about the 
firm, we}can ignore it and treat it as a profit-maximizing 
black bax") by allowing me to pay lip service to the 
relationship between internal organization and market 
structurd, In fact, | include this discussion not because it 
enhance4 the book through its complementarity with sub- 
sequent Fhapters but because I believe that the theory of 
the firm} is a crucial topic in economics as well as an 
integral part of industrial organization. 


Relationships among Chapters 


The chapters are relatively self-contained. Nonetheless, 
some cohnections are to be drawn, For instance, chapters 
5 and 6 generalize parts of chapter I to strategic behavior, 
and chapter 7 does so with respect to chapter 2. The 
rent-dissjpation hypothesis, mentioned in chapter 1, is 


stressed |when it is appropriate. 


Organization within Chapters 


Each chapter is divided into a main text and a supple- 
mentary section. Undergraduates, first-year graduate 
students, and scholars unfamiliar with 1O are advised to 
focus on the main text. Other graduate students and 
scholars familiar with IO will find some more advanced 
material in the supplementary section. 


Exercises 


Exercises have been included to help the reader become 
familiar with the concepts and to broaden his or her 
knowledge. Some exercises appear within a chapter; their 
solutions are sketched at the end of the chapter. In addi- 
tion, review exercises (without answers) are offered at the 
back of the book. Readers who are not able to solve 
exercises in the text should not be discouraged; some of 
them are difficult. Those readers will find easier work in 
the review exercises. | have indexed the exercises by 
asterisks to reflect their difficulty: 


*; simple application of concepts developed in the text 
*#: more difficult; requires more thought 
*#«: advanced; the most challenging. 


Prerequisites 
Economics 


An intermediate microeconomics course is desirable. 
Otherwise, the book is fairly self-contained. I have tried 
to give some motivation for the models when needed. 
Some familiarity with stylized facts, however, would give 
the reader a better perspective on these models. A pre- 
liminary or simultaneous reading of Scherer’s classic text 
(1980) would be valuable. Books on business strategy 
(e.g. Porter 1980) and antitrust policy (e.g. Areeda 1974, 
Areeda and Turer 1976, Blair and Kaserman 1985, Posner 
and Easterbrook 1981) may also prove useful in. this 
respect. 


Mathematics 


I try to present theories in a “reader-friendly” form. I 
often choose specific functional forms (such as linear de- 
mand) over general ones, two-period models over general 
dynamic ones, and duopoly situations over oligopolies. 
My hope is that the intuition behind the phenomena 
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studied here|will emerge strongly enough to convince the 
reader that the results have some robustness. 

Very littl mathematical knowledge is required. For 
the most pdrt, elementary notions of calculus (e.g. un- 
constrained Joptimization) will suffice. The reader should 
know how tp derive the first-order and second-order con- 
ditions assofiated with a maximization problem, be aware 
of the envelope theorem!’ and of the chain rule of dif- 
ferentiation, and have a few notions about concavity.'* 
Some furthdr notions will be used for specific, isolated 
ration by parts, Bayes’ rule, dynamic pro- 
he required notions can generally be found 
rm in the mathematical appendixes of Dixit 
1980 or those of Varian 1978.'* 


inition, Partial Equilibrium, and Welfare 


Criteria 
The Comp¢titive Paradigm 


The best-defeloped and most aesthetically pleasing model 
in the fieldJof economics is the competitive-equilibrium 
paradigm of Arrow and Debreu.'® In brief, this model 
goes as follgws: The model starts with a very fine descrip- 
tion of avajlable goods. An economic good is charac- 
terized by ifs physical properties, the date on which and 
the state of nature in which it is available, its location, 
and so forth. Consumers are perfectly informed about all 
goods’ properties and have preferences over bundles of 
goods. Producers (firms), which are owned by consumers, 


are endowed with production-possibility sets. A para- 
digm of market organization is then added. All agents are 
price takers. The consumers maximize their welfare given 
that their expenditures must not exceed their income 
(which stems from their endowment and their ownership 
of firms). This gives rise to demand functions (‘‘corre- 
spondences,” if there are several welfare-maximizing bun- 
dles). Producers maximize profits over their technological 
possibilities, giving rise to supply functions (or corre- 
spondences). A competitive equilibrium is a set of prices, 
with associated demands and supplies, such that all the 
markets (one for each good) clear (i.e., total demand does 
not exceed total supply). 

Weak assumptions about preferences and technological 
possibilities yield general results on competitive equilib- 
rium. The best-known of these may be the two funda- 
mental welfare theorems. Roughly stated, the first says 
that a competitive equilibrium is Pareto optimal (that is, a 
benevolent and fully informed social planner could not 
replace the competitive allocation of goods with another 
feasible allocation that would increase all the consumers’ 
welfare) and the second asserts that, under convexity 
assumptions (which rule out increasing returns to scale), 
any Pareto-optimal allocation can be decentralized (imple- 
mented by a market organization) by a choice of the right 
prices and an appropriate redistribution of income among 
consumers. 

A key property of competitive equilibrium is that each 
good is sold at marginal cost. A producer would increase 
profit by expanding production of the good if its price 


tion problem (i. 
exogenous parafneter is equal to the partial derivative of the objective function 
with respect to fhe parameter. That is, only the direct effect of the change in 
the parameter slfould be taken into account (and not the indirect effect through 
the change in the endogenous (control) variables, which has only a second- 
order effect), Fof 


V(a) = max fix 


where x*(a) is tle optimal control variable. 


14. A function f(z), where x is a vector of R’, is concave if, for any 4 in (0, 1) 
and all x and x'J 


flax + (1 —Ajaf) > a fla) + (1 — Afr’). 


For a differentiable function, an alternative characterization of concavity is the 
following: For all x and x’, 


fo < or + & Ley, — a 
oy 


(For xin R, the reader can check this inequality by making a diagram.) 

The function f is quasi-concave if the sets in R® defined by {x//(x) > v} are 
convex for all y. For x in R, a slightly stronger and sufficient condition for 
quasi-concavity is that f” <0 whenever f’ = 0. As a diagram will easily show, 
this stronger notion of quasi-concavity (which is weaker than concavity) is all 
that is needed for the second-order conditions in an optimization program to 
be satisfied. We will rarely need this notion, however. 


15. Mote detailed treatments of optimization for economists include Dixit 
1976 and Kamien and Schwartz 1981. 


16, See the beautiful treatments by Debreu (1959), Arrow and Hahn (1970), 
and Mas-Colell (1985). The reader will find simpler versions in Varian 1978 (at 
the graduate level) and in various undergraduate microeconomics texts. 
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exceeded his marginal cost. Conversely, if he produced 
the gdod at all, he would contract production if the 
margirjal cost were to exceed the price. This trivial result 
injportant implications. When deciding whether to 
e one more unit of the good, a consumer faces 
a pric that is socially the “right one” and internalizes 
the coft of producing this extra unit. This is part of the 
intuitipn behind the Pareto optimality of competitive 
equilibrium, 

The first fundamental welfare theorem strongly limits 
the schpe of industrial organization. The organization of 
industfies in such a world is necessarily efficient. The 
only gotential concern for policy is income distribution 
among consumers, which the social planner may not 
judge pptimal,'” 

The competitive-equilibrium paradigm makes relatively 
weak Assumptions about preferences and production pos- 
sibilitips, but only within a given class. Among the condi- 


Partial Equilibrium, Downward-Sloping Demand 
Curves, and Consumer Surplus 


Once some of the assumptions of competitive-equilibrium 
analysis are relaxed, very little can be said about eco- 
nomic allocations without more specific assumptions, as 
the “theory of the second best” has taught us. One of 
the costs of moving toward more realistic models of the 
organization of industries is the adoption of a partial- 
equilibrium setup. A good (or a group of related goods) 
is singled out, and the interaction with the rest of the 
economy is ignored. 

We will come back to the notion of market shortly; for 
the moment, let us consider the validity of two assump- 
tions that will be made throughout the book: that the 
demand for a good decreases with its price and that 
changes in consumer welfare can be measured by the 
so-called consumer surplus. 

First, the notion of consumer surplus: Consider the 
market for a single good. The demand for this good 
is assumed to decrease with its own price and to be 
independent of the prices of other goods and of the 
consumers’ income. To make this rigorous, consider 
“quasi-linear” utility functions: 


m 


Gm) = 40+ ¥ Vulgu)s 


cot 


Ugor dirs 


where good 0 is the numéraire and the functions Vj, are 
increasing and concave. Maximizing U subject to the 
budget constraint 


e 
490 + & Prd <1, 


where I is the consumer's income, yields Vj(q,) = p, for 
all h. Thus, each consumer's demand function for good h, 
and therefore the aggregate demand function, satisfies the 
above conditions. (For the more general quasi-linear pref- 


for IO furposes, Some positive implications can be derived from simple com- 
petitive}models. For instance, there exists an interesting literature (not reviewed 
in this pook) that uses the competitive paradigm to study, in a dynamic 
, the process of entry and exit in an industry and/or to find theoretical 
foundatfons for Gibrat's law (according to which the rate of growth of firms 


18. AnJextemality arises when the consumption of a good by a consumer 
directly|affects the welfare of another consumer, or when a firm's production 


affects other economic agents, A consumer who increases the size of a tele- 
phone network by connecting to the network exerts a positive extemality on 
other consumers, A firm that pollutes a river exerts a negative externality on 
consumers and other firms. 


19. A public good is a good that can be consumed simultaneously by several 
consumers (e.g.. national defense or a TV program), 


20. For an ingenious estimation of the divergence between price and marginal 
cost in various industries, see Hall 1986, 
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erences U(qq.dy,---+4m) = Go + W(qy,.--14m) the de- 
mand functidns exhibit cross-price effects but no income 
effect.) 

Consider 4 homogeneous good. Dupuit (1844) intro- 
duced the firft welfare measurement. (Dupuit's consumer 


surplus is also sometimes called the Marshallian con- 
sumer surplys—see Marshall 1920, p. 811. Henceforth, 
we will call simply the consumer surplus.**) In figure 1, 
the consume} surplus is defined as the area between the 


and the horizontal line at the price level p°. 


purchase either 0 or 1 unit of the goods. The consumers 
are heterogeneous, in that they have different valuations 
or willingnefses to pay (v,) for the good, expressed in 
terms of money (that is, for the quasi-linear utility func- 
tions discusfed above, V,(-) is a step function for each 
consumer, eQual to zero for a consumption of good h 
lower than | and equal to the consumer's willingness to 
pay for good h for a consumption equal to or greater than 
1). Without Joss of generality, let us rank these consumers 
by order of Hecreasing valuations: vp, > v, > ---. A con- 
sumer with valuation v; purchases if and only if v, > p°. 
The first copsumer realizes a surplus v, — p®, because he 
was willing]to pay v,. The second consumer realizes a 
surplus v + p®, and so forth until the marginal consumer 
(call him n),]who realizes approximately no surplus. The 
total consurper surplus is then 


plus, 5*, isfequal to the net consumer surplus plus the 
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Figure 1 
Consumer surplus, 


consumer expenditure p°D(p°). 7 denotes the choke-off 
price (the lowest price at which there is no demand), It is 
equal to v, in the discretized version, but it could also be 
taken to be infinity without any change in the formula. 

Let us now consider a single consumer with a down- 
ward-sloping demand D(-) for the good. Dupuit’s reason- 
ing is that this consumer can be thought of as composed 
of consumers with unit demands. That is, he is willing to 
pay v, for the first unit purchased, v for the second, and 
so forth. Overall, his net surplus from consuming q° units 
of the good at price p® is given by equation 1. From now 
on, we will consider a single consumer, Only later will we 
come back to multiple consumers. 

The changes in net and gross consumer surplus when 
the consumer price moves from p® to p' are defined by 
the following equations: 


AS* = -{" D(p)dp, 
re 
(2) 


* 
ASt= -{ D(p)dp + [p'D(p') — p°D(p®)]. 
re 


The producer surplus is defined as the profit of the firm 
in the industry. Figure 2 shows the marginal-cost curve 
(which coincides with the supply curve under perfect 
competition). Profit is equal to revenue (p°D(p®)) minus 


21, See Auerbath 1986 for an extensive review of various measures of surplus 
and excess burdpn, See also the classic discussions of consumer surplus by Hicks 
(1941) and Sampelson (1947). 
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A f4mous application of this is the derivation of a 


monetdry measure of the welfare loss (“dead-weight loss”) 


23. Hotdling (1938) and Harberger (1964) later proved the result more 
formally Jand generalized it to several goods. Boiteux (1951) and Debreu 
(1951) gave important measures of dead-weight loss in a general-equilibrium 
context. 


the total cost from the gross consumer surplus. Monetary 
transfers among consumers, producers, and government 
are irrelevant to the computation of the total surplus.) 
As Dupuit suggested, the dead-weight loss can thus be 
approximated by 


4tlq? — q°| = 4P|D'(p*)| 
for a small tax and for a constant marginal cost.?* 


The rest of this section is more technical than most of 
the introduction and can be skipped in a first reading. It 
discusses the extension of consumer surplus to the multi- 
product case, and then goes on to find conditions under 
which the demand for each good is downward sloping 
and under which the consumer surplus is a good approxi- 
mation of welfare. 


There are two obvious questions about consumer sur- 
plus: Does it generalize to more than one good? Can it be 
expressed in terms of classical demand theory? To answer 
these questions, consider general demand functions q, 
= D,(p.1), where the demand for good h depends on the 
price vector p and the consumer's income I. 

The generalization to a group of several goods seems 
straightforward; one can simply add up the consumer 
surpluses for the various goods. The variation in net con- 
sumer surplus from vector p® to vector p' is 


ast=5.a5.=— |" F avin @) 
* pe 

An unfortunate feature of this consumer surplus with 
many goods is that equation 3 does not always define a 
unique number. The integral in general depends on the 
path of integration from the initial price p® to the final 
price p', as is easily checked. It is path independent (and 
thus well defined) only if the demand functions exhibit no 
income effect (or, more generally, if the cross-partial deri- 
vatives of the demand functions are equal).?* 

This drawback is actually related to the link with foun- 
dations in demand theory. Hicks (1946) introduced two 


24, To see this, consider two prices: change one first and then the other, 
and conversely. Taking the difference between the two measures and writing 
demand functions as the integral of their partial derivatives yields a term in 
@D,/ap, — @D,/épz. This term would be equal to zero if the demands were 
compensated ones (from the symmetry of the Slutsky matrix); however, these 
demands are ordinary ones. For a reminder of the notion of compensated 
demand, see Varian 1978, For a good presentation of the problems studied 
here, see Auerbach 1986, 
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further mpnetary measures for changes in consumer util- 


falled the substitution effect. The second term, 
income effect, is negative for a normal good and 
an inferior good (and is equal to zero in the 
asi-linear utility functions). It stems from the 


positive 
case of 


Ph 
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Figure 3 
Compensating and equivalent variations and consumer surplus. 


Consider a single price change (that of good h, say). 

It is easily seen that the equivalent (respectively, com- 
pensating) variation from price p® to price p' is equal to 
the area under the compensated-demand curve at utility 
level u* (respectively, u°),?® Using the facts that D(p°, 1) 
= D*(p®, u°) and D(p!, !) = D*(p', u’), we can represent 
the consumer surplus and the equivalent and compen- 
sating variations as in figure 3 (which depicts the case of 
a normal good). The change in consumer surplus is equal 
to the area A + B, the equivalent variation is equal to the 
area A, and the compensating variation is equal to the 
area A + B+ C27 


25, Let 
Elp.u) = min{p a}, 
subject to Lfq) > u, denote the consumer's expenditure function, where Li(-) 


denotes the futility function, This is the amount of income required to reach 
utility level 4 at price vector p. Let 


subject to p]q < /, denote the indirect utility function, For a price change from 
p" to p', the compensating variation is, 

CV = Ep! ip.) = 1 

and the equipalent variation is 

EV =1— Ep”, Vip! 1)), 

The equivalent variation is an acceptable measure of welfare in that compar- 
ing welfare Bt prices p' and p*—ie, V(p'.1) and Vip*,/)—is equivalent to 
comparing fhe equivalent variations from p® to p' and from p® to p?. This 
property, in}general, does not hold for the compensating variation. 


10 


‘The two variations do not exhibit path dependence in the case of multiple 
price changes. 


26. Formally, the equivalent variation is 
E(p’, Vip*.1)) — Elp®, Vip.) 


oe 
- jf Dyip. Vip'. D)dps. 
ie 


where I= E(p'. Vip.) = E(p®. Vip®.1)) and where the envelope theorem 
and the definition of the expenditure function are used to obtain the derivative 
of the expenditure function with respect to price. Similarly for the compensating 
variation. 
27. Here the two Hicksian variations bracket the consumer surplus, This may 
not hold when dead-weight losses, rather than surpluses or variations, are 
considered; see Hausman 1981, 

For generalizations of the dead-weight loss to multiple price changes, see 
Mohring 1971 and Diamond and McFadden 1974, The equivalent and the 
compensating variation, respectively, are used in those papers. 
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How well does the consumer surplus approximate the 
Hicksi§n (equivalent or compensating) variations? Because 


function, respectively, the Slutsky equation sug- 
at the discrepancy between consumer surplus and 
variations is small when income effects are small. 


er surplus, which depend on the income elasticity 
d and/or the expenditure share of the good.?® 

g these lines, note that the income effect is likely 
all if the good in question represents only a small 
of expenditure. If we let 


a= {2/2 

"Ten Pe 

and 

= | er iF 

a= _ 
Opn | Ph 


aD, [Dy 
a] I 
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D, on good h is small relative to income, the 
effect is negligible. Two very useful facts follow: 


+ The demand curve for good h is downward sloping, 
because the compensated-demand curve is downward 
sloping. 

+ The consumer surplus, and the dead-weight loss com- 
puted from it, are good welfare approximations. 


This intuition goes back at least to Marshall (1920, 
p. 842), who argued that the above two statements 
should hold on the basis that the consumer's “expenditure 
on any one thing, as, for instance, tea, is only a small part 
of his whole income.” Vives (1987) confirms Marshall's 
intuition. Under some assumptions,?° he shows that when 
the consumer consumes a large number of goods n, the 
following statements hold; 


+ The income derivative of demand on one good is small 
(of order 1/,/n, and even 1/n if preferences are addi- 
tively separable), and the demand curves are downward 
sloping.*° 

+ For a single price change, the percentage error in ap- 
proximating the Hicksian variations by the consumer sur- 
plus is small (of order V/n as well). Furthermore, the 
same thing holds for the approximation of the dead- 
weight loss. 


+ For a multiple price change, the Hicksian variations are 
also well approximated by the multigood consumer sur- 
plus, independent of the order of prices with respect to 
which integration takes place. (Recall that consumer sur- 
plus may not be defined uniquely when the prices of 
several goods change; it is path dependent.) 


The goods and industries considered in this book gen- 
erally represent only a small share of consumer expendi- 
ture. Price changes are therefore likely to generate small 
income effects, and it may be appropriate to assume that 


1981. 


symmetfical enough (so as to avoid the possibility that one good picks up most 
of the it}come effects), that no two goods are close to being perfect substitutes 
(to avoifl the possibility that one good picks up most of the demand), and that 
imer's utility function satisfies 2 curvature condition. Then income 
effects panish while the substitution effects remain large as the number of 
goods tfnds to infinity. 


i 


30, There exists a different literature on finding conditions under which the 
demand curve for a good is downward sloping. For instance, Hildenbrand 
(1983) shows that if all consumers have the same demand function and the 
distribution of income is given by a decreasing density, all demand functions 
are downward sloping. (See also Chiappori 1985.) This approach does not 
require a large number of goods (and the associated assumptions—see note 
29). However, the assumptions of identical preferences and decreasing income 
density are quite strong, 

The need for strong assumptions is not surprising in view of the earlier 
contributions of Sonnenschein (1973), Debreu (1974), and Mantel (1976) on the 
subject of excess-demand functions. Those authors showed that, as long as 
there are at least as many consumers as there are goods, absolutely no restric- 
tion (beyond homogeneity of degree 0 with respect to prices and Walras’ law) 
could be put on aggregate demand functions. 
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demand is downward sloping and that the consumer sur- 
plus is a good ppproximation of welfare. 


Extending the single-consumer case to multiple con- 
sumers create} new difficulties. One can, for instance, 
define the aggregate equivalent variation as the sum of 
individual eqdivalent variations without creating diffi- 
culty; howeve}, the issue is that the aggregate equivalent 
variation is nqt, in general, insensitive to redistributions 
of income be{ween the consumers. Only under strong 
conditions cap one ignore the distribution of income.*! 
1986 for more on this.) 

. | will treat income distribution as irrele- 
vant, In othe words, the redistribution of income from 
one consumey to another is assumed to have no welfare 
effect. (The njarginal social utilities of income are equal- 
ized.) I certainly do not feel that actual income distri- 
butions are gptimal, even with an optimal income-tax 
structure (bedause there are limits and costs to income 
taxation, as iq emphasized by the optimal-taxation litera- 
ture). Marketfintervention does have desirable or undesir- 
able income-fedistribution effects. But I will focus on the 
efficiency of sharkets, using Musgrave's (1959) framework 
in which the} distribution branch of government worries 
tion and the allocation branch (the one 
this book) deals with efficiency. The "com- 


le need only be concerned about efficiency; 
jus increases, the winners can compensate 


| our welfare conclusions, For instance, the 
chapter 3 that allowing a monopolist to 
inate perfectly improves welfare would be 


in mind in 
conclusion 


What Is a Market? 


The notion of a market is by no means simple. Obviously 
we do not want to restrict ourselves to the homogeneous- 
good case. If we posit that two goods belong to the same 
market if and only if they are perfect substitutes, then 
virtually all markets would be served by a single firm— 
firms produce goods that are at least slightly differentiated 
(either physically or in terms of location, availability, con- 
sumer information, or some other factor). But most firms 
actually do not enjoy pure monopoly power. An increase 
in price leads consumers to substitute somewhat toward a 
small number of alternative goods. Therefore, the defini- 
tion of a market should not be too narrow.?* 

The definition should not be too broad either. Any 
good is potentially a substitute for another, if only in 
an infinitesimal way. However, a market should not be 
the entire economy, In particular, it should allow partial- 
equilibrium analysis. It should also allow a single descrip- 
tion of the main interactions among firms. 

It is also important to realize that the “right” definition 
of a market depends on the use to which it will be put. For 
instance, consider the case of coal. If one is interested in 
broad issues of energy policy (such as the effect of sub- 
sidizing certain types of energy), the relevant market is 
the energy market, including coal, gas, oil, and nuclear 
power. The analysis of long-term contracting and vertical 
integration between U.S. coal producers and electric utili- 
ties is best conducted at the level of the region (e.g. the 
Northeast, the Midwest, and the West; see the chapter on 
the theory of the firm), To assess the competitive effects 
of a merger between two coal suppliers, one looks at a 
much narrower definition of a market, because of the high 
transportation costs. 

There is no simple recipe for defining a market, as is 
demonstrated by the many debates among economists 
and antitrust practitioners about the degree of monopoly 
power in specific industries. Several useful (though imper- 
fect) criteria have been offered, however. Robinson (1933) 


31. More speci 
rman polar form| 


lly, the demand function of consumer j must take the “Gor- 


Dip. 1!) = @(p) + Alpil 


where © is hor 
1 


eneous of degree O in prices and @ is homogeneous of degree 


32, This subsection draws on lectures given by Paul Joskow and Richard 
Schmalensee. 
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suggebted beginning with a given good and then looking 
at the good’s substitutes, and the substitutes for these 
icant gap in 
the cHain of substitutes. These gaps, she asserted, define 


and that therefore their prices tend to be correlated. How- 
ever, jprice correlation is at best a necessary condition for 
belonging to the same market. Boston Edison and Elec- 
tricité|de France, which both distribute electricity, are by 
no means competitors, although their prices are likely to 
be correlated because the prices of their fuels are. Con- 
cluding they belong to the same market just because their 
prices are highly correlated would be erroneous. 

For|the purpose of the present book, this empirical 
difficulty of defining a market will be ignored. It will 
be assumed that the market is well defined, and that it 


a 
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Introduction 


The Theory of the Firm 


The basic actors in this book are firms. But what is a firm, 
and how do firms behave? These two questions are the 
subjects of this preliminary chapter. 

The notions of a “firm,” of “vertical integration,” and 
of “authority” are by no means simple, and consequently 
they have been given various meanings in the literature. 
The frequently used definitions that will be mentioned 
and classified here reflect different facets of the notions 
and can be thought of as complementary. They share the 
idea that a firm should be able to produce (or sell) more 
efficiently than would its constituent parts acting sepa- 
rately, These definitions thus entail very explicit 
optimizing approaches. 

The behavior of firms is not a simple matter, either. 
There are many ways in which business decision makers 
may deviate from profit-maximizing behavior and many 
mechanisms that, in turn, limit managerial discretion. Fur- 
thermore, most of the recent developments in the study 
of firms’ decision making and behavior are based on the 
theory of incentives, a full treatment of which is outside 
the scope of this book. Here, again, the main arguments 
will be introduced and classified. 

This chapter is at worst a hodgepodge of received 
ideas; at best it is a road map of the relevant contribu- 
tions. A large number of theories are reviewed at a fast 
pace. Numerous references are included for the interested 
reader, who may also find the surveys by Hart and 
Holmstrém (1987) and Holmstrém and Tirole (1987) 
useful." 

It is customary to distinguish—not always without 
ambiguity—the horizontal and vertical aspects of a firm's 
size. The horizontal dimension refers to the scale of pro- 
duction in a single-product firm or to its scope in a multi- 
product one. The vertical dimension reflects the extent to 
which goods and services that can be purchased from 


1. Har} and Holmstrom 1987 develops some of the themes of this chapter in 
more getail Holmstrom and Tirole 1987 is less methodological and more 


issue-oriented, and covers a substantially broader selection of topics than the 
present chapter. 


outsiders are] produced in house, A wallpaper manufac- 
turer merging with another wallpaper manufacturer or 2 
tile producey engages in horizontal integration; a wine- 
ing a bottle or cork factory engages in 
vertical integration. 

This chapter is divided into two sections. Section 1 
looks at varibus definitions of a firm. It does not focus on 
the advantages (limited liability, issuing of shares, etc.) 
brought about by the legal constitution of some form of 
firm (e.g., a hited partnership or a corporation).* Rather, 
it focuses of what determines the size of the firm. How- 
ever, one legal aspect should be mentioned at the outset: 
Sometimes |the internalization of transactions within a 
firm is a way of concealing practices deemed illegal under, 
say, antitrugt laws. A couple of such examples are given, 


is concerned mainly with the efficiency (or 
“non-monopply") reasons for integration or disintegration. 


ment to trade) there may be many suppliers and buyers, 
but ex post (once investments have been made) the parties 
may be in a bilateral monopoly situation. The supplier 
may not find alternative outlets, and the buyer may not 
be able to contract with a new supplier on time. This 
absence of ex post competition raises the possibility of 
“hold-up” or “opportunism’ (the confiscation of the gains 
associated with one party's investment by the other party). 
A long-term contract must ex post guarantee the parties a 
fair return in order to ex ante encourage specific invest- 
ment. It must also ex pos! guarantee the efficient volume 
of trade by prohibiting monopoly pricing. 

These first two views have relatively little to do with 
the legal definition of a firm (or with each other). Firms 
that agree to share some fixed costs (for instance, the 
study of export markets, the procurement of inputs, or 
some research and development), a manufacturer and a 
retailer who sign a contract specifying stringent vertical 
restraints, and a power plant and a coal mine located next 
to each other that sign a detailed thirty-year supply con- 
tract can all be considered at least partially integrated, 
although legally they remain separate entities. (Consider, 
in contrast, an employee hired for a particular task, or one 
who can quit or be fired without advance notice and 
without penalties for breach). 

The third view of the firm—the incomplete-contracting 
view—comes somewhat closer to the legal definition(s). 
It emphasizes that firms and contracts are rather different 
“governance modes.” It looks at the firm as a particular 
way of specifying what is to be done in the event of 
contingencies not foreseen in a contract. It starts from the 
idea that contracts are necessarily incomplete, because 
some contingencies are unforeseeable or because there 
are too many of them to specify in writing, so that 
cost minimization requires the original contract to define 
only the broad lines of the relationship. The ownership 
of machines and the employment relationship give the 
owner of the firm the power (’authority”) to choose 
the way machines and personnel are to be employed. 
within certain limits. The degree of integration can then 


analyses of fhe various ownership structures. 


3, Unless tere are production externalities between firms, Such externalities 
arise. for inftance When the input markets are noncompetitive, Or, as in the 
Hart modelfof managerial incentives mentioned below, the control of a firm’s 
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managers by its shareholders depends (for informational reasons) on the inter- 
ral organization of other firms in the industry, 


4. Very roughly, economies of scale exist when the production cost of a single 
product decreases with the number of units produced: economies of scope are 
cost-saving externalities between product lines (e.g, the production of good A 
reduces the production cost of good B) 
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firm's size According to the various definitions of a firm. 
Among thgse factors are exhaustion of returns to scale 
and scope,|rare factors, the drawbacks of long-run rela- 
tionships, and excessive concentration of power. 

Section p. takes firms as well-defined entities. Rather 
than analype their size, it investigates how firms be- 
have. Do firms actually maximize profits, as is presumed 
throughoul} this book? If they do not, is the profit- 
maximizatipn hypothesis still a useful guide for predicting 
monopolistic behavior or strategic interaction? The vari- 
ous mechahisms that put a lid on managerial discretion 
(monitoring; the incentives of the product, labor, and 
capital markets) are reviewed, and the profit-maximization 
postulate iy then discussed. 


1 What fs a Firm? 


In subsectipns 1.2-1.4 we shall examine three views of 
the firm a4 a cost-minimizing device. Before doing so, 
however, l¢t us dispose of some motives for integration. 


1.1 The Firm as a Loophole for the Exercise of 
Monopoly Power 


Various traHe practices are, rightly or wrongly, banned by 
ulations as evil manifestations of monopoly 


Price Diserjmination 


As will be|shown in chapter 3, a firm may want to sell 
a given prpduct in different markets at different prices. 
This raises |the possibility of arbitrage among the retailers 
serving th¢se markets. To avoid arbitrage, the manufac- 
turer may |ntegrate into distribution and serve the low- 
price market himself, A similar phenomenon, but one 
induced by the law rather than by arbitrage, arises when 
the law fofces a firm to treat two customers in “similar 


situations” symmetrically (as is the case in principle in the 
United States or in France) but the manufacturer desires 
to treat them differently or to foreclose one customer's 
access. By integrating with one customer, the manufac- 
turer can treat the other differently or perhaps shut the 
intermediate-good market altogether. 


Intermediate Price Controls 


Suppose the price of an intermediate good traded be- 
tween a large number of suppliers and a large number of 
buyers is set by regulatory agencies below the market 
equilibrium price determined by supply and demand. The 
good must then be distributed to the buyers by some 
rationing mechanism. This gives the suppliers an incen- 
tive to merge with some of their buyers. Why? The trans- 
action (legal) price of the intermediate good is lower than 
the shadow price that the buyers are willing to pay. 
Therefore, there exist appropriable rents to be distributed 
between the suppliers and the rationed users. However, 
the suppliers, who could obtain their share of appropriable 
rents by increasing the price in an unconstrained market, 
cannot do so in the regulated case. Vertical integration 
allows suppliers to circumvent the law by generating 
internal—and consequently unobservable—transactions. 

There are other legal reasons that may lead vertically 
related firms to integrate. One is the existence of a sales 
tax, The internalization of transactions avoids the pay- 
ment of such a tax at the intermediate stage. (This phe- 
nomenon does not occur when the tax base is the value 
added rather than the accumulated value of the product 
up to the transaction. A value-added tax is neutral with 
respect to the integration decision.) Another reason is 
the existence of a rate-of-return regulation. A firm that 
is subject to such a regulation can earn more profits 
by integrating backwards into the equipment-supply in- 
dustry if equipment-transfer prices are not regulated,° 

Chapter 4 is dedicated to the study of the economic 
rather than the legal motives for vertical integration that 
are related to the exercise of monopoly power. 


Remark Horizontal integration may also be motivated 
by the desire to exercise monopoly power. The obvious 
example is that of a merger of two firms that produce the 


5. Furthermor$, the law often distinguishes quite explicitly between internal 
and external trpnsactions. 


4. ‘The regulated firm can inflate transfer prices so as to relax the rate-of-retun 
regulation, See Dayan 1972. 
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same| products, Such mergers, prominent during the great 
r wave of 1887-1904 in the United States,” are 
meant to eliminate the dissipation of monopoly 
profil through product competition (see chapter 5).* 


1,2 | The Firm as a Static Synergy 


An did theme in industrial organization, following Viner's 
(193p) classic investigation of cost curves, is that the size 
and fhe number of the firms in an industry are related to 
the degree of returns of scale. 

e of the main determinants of the size of a firm is 
the qxtent to which it can exploit economies of scale or 
of sdope. As has been well documented by engineers,” 
highpr levels of production permit the use of more effi- 
cienlf techniques. They vindicate the investment in cost- 
reduking technologies, and they allow workers to be 
morf specialized, Unit costs decrease. Such economies of 
related to the volume of a single product, are called 


Chief 
executive 
Manufacturing Sales | [ Finance] | Engineering 
Figure 1 


An example of unitary-form organization. Source: Williamson'1975, 
p. 134. 


Alternatively, the gathering of activities, be they rela- 
tive to the same product or to different products, may 
relate not to production in the narrow sense but to all the 
services that accompany production: auditing, marketing, 
personnel service, finance, distribution, materials procure- 
ment, research and development, and so forth, Such a 
gathering avoids duplication of fixed costs associated 
with these services, or at least it reduces these costs on 
the average. Demand complementarities may also be a 
motive for coordinating activities."* 

The late nineteenth century witnessed the emergence 
of large, multifunctional firms, which were organized in a 
unitary (U) form,'? with the units regrouped by their 
functions as shown in figure 1. This sort of organization 
can be seen as an attempt to exploit potential economies 
of scale. Within each unit, the large size reduces the unit 
cost of exercising the function (that is, producing the 
good or the service). 

However pervasive they may be, returns to scale have 
their limits. Machines or functional divisions related to 
two production units can be advantageously pulled to- 
gether only if they are not employed to their capacities. 
Similarly, the savings in peak-load capacity associated 
with the pooling of risk and the law of large numbers 
become smaller and smaller as the size of the firm grows. 
Furthermore, it is sometimes argued that there may exist 
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tends to a constant-returns-to-scale one when the number of machines tends to 
infinity. 


11. One could think of firms specialized in making left shoes and others in 
making right shoes; however, the required coordination of designs creates a 


synergy. 

12. For discussions of unitary and multidivisional forms, see Chandler 1966 
and Williamson 1975. (The U-form—the logical one from a technological 
viewpoint —faces serious coordination and incentive problems and has often 
been replaced by the M-form, See page 47.) 
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rare factors, such as managerial talent, that cannot be 
duplicat¢d as the firm expands.'> 

Let ys now briefly consider the formalizations of 
returns fo scale and to scope. (A much more complete 
treatmeat can be found in Baumol et al. 1982—see 
chapter P for the single-product case and chapters 3 and 
4 for th¢ multiproduct case.) 

Let us begin with the single-product firm. Let C{q) 
denote } firm's total cost of producing output q; that is, 
let C(q)|be the minimum cost of a bundle of inputs that 
allows the production of q units of output. For simplic- 
ity, assume that the cost function is twice differentiable, 
except ossibly at zero output: 


@ 
r+[ C'(x)dx forqg>0 


0 


Cq) = 
0 otherwise 


where f]> 0 denotes a fixed production cost. 

Marginal costs are strictly decreasing if C”(q) < 0 for 
all posstble q. Average costs are strictly decreasing if, for 
all q, arfd qz such that 0 <q, <q», 


C(q2) | Clgs) 
4a on 
The cost function is said to be strictly subadditive if, for 


> —_ > >q 
a b c 
Figure 2 


marginal cost. Average costs are everywhere declining 
(except at zero production), although at a vanishing rate. 
Figure 2b shows a U-shaped average-cost curve, The 
average cost declines until it reaches the “most efficient 
scale,” MES, where it intersects the marginal-cost curve; 
then it increases. An example of such a cost curve is 
C(q) = F + aq? (where a > 0), The flat-bottomed average- 
cost curve depicted in figure 2c is intermediate between 
the first-two curves. Average cost stabilizes over a range of 
outputs before decreasing returns make it grow again.'* 

Everywhere-decreasing marginal costs imply every- 
where-decreasing average costs,'S and everywhere- 
decreasing average costs imply subadditivity.!® The con- 
verses of these two propositions are, however, false, (See 
figure 2a for a counterexample to the first converse, and 
proposition 2A1 of Baumol et al, 1982 for a counter- 
example to the second.) 


Remark Several meanings can be given of a natural 
monopoly, depending on the application that is being 
made. Suppose that there exists a commonly available 
technology C(q) that produces some output q. Baumol 
et al. (1982) define an industry as a natural monopoly if, 
over the relevant range of outputs, the cost function is 
subadditive. This definition is the correct one for a well- 
informed planner (a planner who knows the cost function 


adjustmen| cost with leaning about the charactenstics of the employees. The 
firm observes the employees’ performance over time and slowly finds out 
Which tasifs suit each of them best. The trade-off js between growing, quickly 
errors in job assignment, and growing more slowly and capturing 
the gains from more accurate information about the employees comparative 
advantage] This theory reflects the common feeling among executives that 


scarce mayjagerial talents and identify firms with managers. 


La See Bain 1954 and chapter 4 of Scherer 1980 for discussions of the 
evidence on average-cost curves. 


5 i() = <(F) + (fetes) <0 


since Cg) < C'(x) for all x € (0, 4) impliey that C'(q) — f& C'unddr/q < 0. 
16. Let q = ¥,4) (with q, > 0). Then C(g,)/q, > C(gi/@, which implies 
LCM) > Lacwla = Cg) 
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perfegtly). The planner has no incentive to have several 
firms| produce the output when the aggregate output 
could] be produced more cheaply by a single firm. 

In p more positive sense, one can look at behavior in an 
unregulated industry. Let II(n) denote the profit of a 
singl¢ firm when the number of firms in the industry is n. 
(For fimplicity, assume that all the firms make the same 
profit.) This profit is the outcome of whatever type of 
comietition the firms wage, and is net of all costs (in- 
cludihg the fixed cost). It is natural to assume that IT(n) 
decrdases with n. An industry is a natural monopoly if 
11(1)}> 0 > 11(2)—that is, if one firm is viable, but not 
two pr more.!” 

Ohe can also consider the case of a regulator who 
has jncomplete information about the cost function (or 
demdnd). Such a regulator may want to trade off returns 
to schle (which would favor a unique firm) and the extrac- 
tion of the relevant information through competition 
amohg firms (on this see the discussion of tournaments in 
sectipn 2 of the present chapter and the complementary 
sectipn of chapter 4). A natural monopoly then arises 
wheh the regulator prefers production by a single firm. 


For a multiproduct firm, subadditivity generalizes 
natufally, Suppose that q is now a production vector or 
plan} q = (qi,-+-»4m) for the m outputs. Let qh. Q” 
denpte n such vectors. (Superscripts index production 


for ll q such that 5),q' # 0. This definition applies to the 
single-product cost function as a special case. (Let the m 
gocds be homogeneous, ie., be the same good.) More 
intefestingly, it gives one formulation of economies of 
scope. For example, let q, and q, denote two quantities of 


function, 
C(qy, 0) + C(O,q2) > Cla, 42)- 


(C(q,,0) are C(0,qp) are called stand-alone costs.) For 
instance, a single railroad company producing both pas- 
senger travel and freight transport (two different eco- 
nomic goods) is technologically more efficient than two 
companies specialized in one of the two productions, 
Similarly, the production of peak-load and base-load 
demands is less costly when performed by a single 
electric-power company.'* 


Is the Technological View a “Theory of the Firm’? 


The technological view aims at defining the size of 
a firm. Economies of scale encourage the gathering of 
activities. And the limit to the size stems from the fact 
that average-cost curves rise at high output, as in figures 
2b and 2c. There are two related reasons why this view, 
as it stands, does not quite constitute a theory of firm size. 

First, it is not clear why economies of scale should 
necessarily be exploited within the firm. They could, 
a priori, also be obtained through contracting between 
legally separate entities. For example, consider the above 
“economies of massed reserves” argument. As was noted, 
a firm serving several markets faces less uncertainty than 
a number of separate firms serving these markets inde- 
pendently. It can thus reduce costly investment in peak- 
load equipment. But a similar outcome could be obtained 
through an agreement to transfer the good at some (pos- 
sibly contingent) prices. Indeed, electric utilities do engage 
in arrangements to pool electric power. Similarly, the “re- 
pairman problem” of Arrow et al. (1972), which involves 
an increasing-returns-to-scale production function, does 
not imply that all production should take place in a single 
firm. The various units could sign a contract in which they 
agree to share the services of repairmen, or an indepen- 
dent repair firm could be established that would supply 
the services to the units. 


the retums-to-scale argument looks at cost or demand interdependencies. 
can also be synergies between unrelated units. Two such synergies, 
Id on uncertainty and the diversification of risks, have been put forward as 
attempts to explain conglomerate mergers (Le,, mergers not motivated by cost 
‘or demand interactions —waves of which occurred in the United States in the 
1 and the 1980s). First, it has been argued that a diversification into 
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unrelated activities allows firms to reduce risk. (This argument does not explain 
why firms diversify their activities when their shareholders could duplicate this 
operation by diversifying, their portfolios) Second, when diversification still 
Jeaves the top managers in charge of all activities, the measurement of these 
managers’ performance is less garbled by the randomness in each activity's 
profit (by the law of large numbers), so that managerial incentive schemes can 
relate more closely to actual performance (see Aron 1984), 
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Sdcond, we should not take it for granted that average- 


limits to firm size? (An element of the answer is 
in subsection 1,4 below.) 


1,3 | The Firm as a Long-Run Relationship 


In subsection 1.2 we investigated some reasons why, ata 
giver] point in time, some units might want to merge or 
coordinate their activities through a static contract. We 
now |study the new problems associated with long-run 
relatipnships—in particular, why the rules that govern 
trade|tomorrow ought to be determined today whenever 
this if feasible.‘ For simplicity, this discussion will be 
restrifted to a vertical relationship between a supplier and 
a buyer. In the discussion of uncertainty, it will also be 
assunjed that both parties are risk-neutral. 


Idiosyncratic Investment and Asset Specificity 


Longjrun relationships are often associated with either 
switching costs or specific investments, Switching costs 


that njay prevent a regulator from using repeated bidding 
to allgcate the monopolistic supply of a good are likely 
also th prevent a buyer from repeatedly using the spot 
markef to purchase certain goods or services from a 


investment. Once the two parties have traded, staying 
together can yield a surplus relative to trading with other 
parties. More generally, idiosyncratic investment can be 
associated with the prospect of future trading rather than 
with current trading. This is the case, for instance, when 
a supplier must design equipment the characteristics of 
which are specific (dedicated) to a buyer's particular order 
{as is often the case with machine tools), or when a buyer 
spends money and effort to sell or promote a final pro- 
duct before an intermediate good used to produce this 
final product is delivered by the supplier (as when a 
concert organizer rents a concert hall before the singer 
performs), or when a user of raw materials buys machines 
that are adapted to the use of certain materials, Williamson 
(1975) distinguishes two further types of specificity: site 
specificity and specific investments in human capital. Site 
specificity is associated with the gain in trading with 
a nearby supplier or buyer. For instance, in the steel 
industry, the integration of smelting (upstream) and lami- 
nating (downstream) operations affords a reduction in 
transportation costs and avoids the need to reheat the 
steel, Similarly, the location of a power plant near a coal 
mine saves on transportation costs. Specific investments 
in human capital involve, for example, the learning, of 
processes and team work.?! 

All these types of specificity have the same outcome: 
The parties that contract now know that later on there 
will be gains from trade between them to be exploited. It 
is important that these gains from trade be exploited 
correctly (i.e., that there be an efficient amount of trade ex 
post) and that they be divided properly in order to induce 
the efficient amount of specific investment ex ante. 

A crucial aspect of specific investment is that even 
though the supplier and the buyer may select each other 
ex ante in a pool of competitive suppliers and buyers, 
they end up forming an ex post bilateral monopoly in that 
they have an incentive to trade between them rather than 
with outside parties. Under bilateral monopoly, each party 
wants to appropriate the common surplus ex post, thus 
jeopardizing the efficient realization of trade ex post and 
the efficient amounts of specific investments ex ante.?? 


19. Fork discussion of incomplete contracts, see subsection 1.4. 


20. Williamson responded to earlier contributions by Demsetz (1968), Stigler 
(1968), frd Posner (1972), who suggested the use of repeated bidding as a 
means ty prevent excessively deviant behavior by the regulated monopoly. 
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21. For an analysis of specific investments in a labor context, see Williamson 
et al. 1975. 


22. The following discussion draws, in particular, from Farrell 1985, Grossman 
and Hart 1984, and Hall and Lazear 1984. 
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Bilateral Monopoly Pricing and the Ex Post Volume 


of Trade 


To be coferete, let us assume that there are two periods: 
t = 1 (ex }nte) and t = 2 (ex post). A supplier and a buyer 
may or njay not contract in period 1. To focus on the ex 
post issue] we ignore for the moment first-period specific 
investments. At the beginning of period 2, the two parties 
lear how much they will gain from trading in period 2 
(these vafiables can be seen at date 1 as being random). If 
they cho#se to do so, they trade one unit of an indivisible 
good (oq the supplier realizes a “project”), Thus, the 
volume qf trade is either 0 or I (in a more complicated 
model, it kould be a continuous variable). The value of the 
good tothe buyer is v, and its production cost to the 
supplier i c.29 So, the gains from trade (if any) to be split 
between |the two parties are equal to 0 —c. If p is the 
trading pice, the buyer's surplus from the relationship is 
v — p anf the seller's surplus is p — c. In the absence of 
trade, thd surplus is nil for the two parties. 


paining occurs in period 2 to determine whether 
ind at what price. 

ld c are common knowledge (i.e., if both parties 
ind c before bargaining), we should expect the 
mount of trade (ie., that trade occurs if and only 
To see why, suppose that » > c and that no 


efficient 


surplus. 


refusing to trade, More generally, bargaining 
metric information is efficient, so the issue of 


[1960] theorem.) As we now see, asymmetric information 
may yield bargaining inefficiencies. 

Often the buyer's value v and the supplier's cost c are 
“private information.” The value is observed only by the 
buyer, the cost only by the seller.>* The efficient volume 
of trade may then not be reached because of the (bilateral) 
monopoly pricing problem. The inefficiency stems from 
the fact that both parties would like to appropriate the 
gains from trade but, because of asymmetric information, 
run the risk of forgoing trade in order to get a larger share 
of the pie in case of trade. They may be too demanding. 

A simple example helps to illustrate how the “wrong 
distribution of bargaining power” creates trade inefficien- 
cies: Suppose that the cost c is known to both parties, that 
the value o is known to the buyer only, and that the 
supplier's beliefs about v are represented by a cumulative 
probability distribution F(x) with density f(v) > 0 on an 
interval |v, 3] (with F(v) = 0 and F(d) = 1). Assume that 
gains from trade exist with positive probability (ie. 7 > c) 
and, for simplicity, that this probability is less than 1 (i.e., 
vy <0). Suppose further that the supplier has all the bar- 
gaining power in period 2—that is, the supplier can make 
a “take it or leave it” price offer p to the buyer. (Of course, 
in a bilateral monopoly situation one would expect bar- 
gaining power to be more evenly distributed; but this 
extreme case offers simple and illustrative conclusions.) If 
the supplier offers p, the buyer accepts only if v > p. 
Thus, the probability of trading is 1 — F(p) and the sup- 
plier’s expected profit is 


(p—ol1 — Fp. 


Maximizing with respect to p yields the first-order condi- 
tion?> 

{1 — F(p)] — (p — of (p) = 0. (1) 
Equation 1 says that an increase in price from p to p + dp 
yields extra profits dp with probability 1 — F(p) and leads 


to a loss of trade, and hence to a loss of net profit p — ¢ 
with probability f(p)dp. At the optimum, these two effects 


23, The vilue and the cost may reflect outside opportunities as well. For 
instance, v]may be the difference between the value to the buyer in this 
relationship]and that in an alternative relationship. 


24. In somp instances, these values could be inferred ex past from accounting 
data in the tase of trade. However, such accounting data may be garbled. For 
instance, tH supplier's cost may be hard to distinguish from costs associated 
with other projects; also. it may be garbled by “moral hazard” (the unobserv- 
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able level of cost-reducing effort exerted ex post by the supplier), Similarly, how 
much the trade in this intermediate good contributes to the buyer's overall 
profit may be hard to discern, 


25. The local second-order condition is —2/(p) — (p= of'(p) < 0. Using the 
first-order condition, the reader will check that the local second-order condition 
is satisfied if and only if the “hazard rate” of the distribution //(1 — F) is 
increasing. 
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cancel. Notice that the volume of trade is suboptimal.?° 
The efficlent volume of trade would arise if and only if 
p = c (thp buyer pays exactly the supplier's cost and thus 


yields no profit to the supplier; raising the price above 
cost yields a profit with some probability; the forgone 
volume qf trade associated with this increase is costless 
because the initial price-cost margin is zero. 

Equatign I is nothing but the familiar monopoly pric- 
ing formpla (see chapter 1) for demand curve q = D(p) 
=1—F{p). For a risk-neutral seller, a continuum of 
buyers with unit demands and valuations distributed 
according] to some cumulative distribution F(-) is equiva- 
lent to aJsingle buyer with unit demand and a random 
valuation|determined by F(-).27 


Remark |More generally, it can be shown that as long as 
and cost are private information, as long as 
trade do not arise with certainty (i.e, there 
is some pfobability that v < c), and as long as the parties 
are free fot to trade (ie, can guarantee themselves a 
zero surplus by quitting the bargaining process), there 
exists noJefficient bargaining process (see Myerson and 
Satterthwpite 1983).?* Bargaining creates some ineffi- 
ciency (ir) general, in the direction of too little trade). 
The intuifion is that the previous monopoly-pricing in- 


efficiency is bound to arise because each bargainer has 
incomplete information about the other party. 


Contracting 


Ex post trade inefficiency gives the parties incentives to 
contract ex ante to avoid or limit this inefficiency. In the 
preceding case, in which only the buyer's value is private 
information, there is a simple way to do so. It suffices 
to give the “informed party’—the buyer—the right to 
choose the price (i.e., to reverse the bargaining power), 
Since c is known, no inefficiency will arise. The buyer's 
monopoly price is equal to the price that leaves the seller 
indifferent about either accepting or refusing to trade: 
p=c. The buyer ex post appropriates all the gains from 
trade. An unconditional ex ante payment from the buyer 
to the supplier can be negotiated to create any division of 
this optimal joint surplus. (More generally, the contract 
aims to create the largest possible “pie.” The division of 
this pie depends on ex ante relative bargaining powers.) 

Similarly, if the buyer’s value is common knowledge 
and the supplier's cost is private information, giving the 
supplier the right to fix the price is efficient. Such rights 
are somewhat similar to a sequential authority relationship 
(see subsection 1.4) in which one party has the authority 
to choose the price and the other party has only the 
authority over the trade decision. 

In the case in which one of the parties’ value or cost 
is already common knowledge at the contract date, this 


26, IF itis cofmon knowledge that there are gains from trade (i, o> ©), the 
volume of tre may or may not be efficient 


27. Similarly] dynamic bargaining between a buyer and a seller in which the 
seller makes qffers sequentially can be reinterpreted as an intertemporal monop- 
oly price disfimination problem (see the supplementary section at the end of 
chapter 1), Irleed, the monopoly pricing problem examined here is the one 
period versidn (or the multiperiod version with commitment) of the inter- 
temporal prief discrimination problem. 


28, The advinced reader may check this in the following way, Let the sup- 
plier's cost cfpe distributed on {c.c] according to the cdf. G(-) with density 
8(°), and let the buyer's valuation v be distributed according to the edf. F(-), 
with density (+), For exchange to be efficient, the parties must trade if and only 
ify > c. Thug the total ex ante expected surplus is 


sam! 
Next, let VG) denote the buyer's expected profit in the bargaining process 
(the expectatibn being taken over the seller’s cost), One has V(v) = G(v); ie., 
the expected] profit grows at a rate equal to the probability that a buyer 


with valuatioh v trades. (To see this, note that a buyer with valuation » can 
always behafe like a buyer with valuation » +do, so Viv + dv) — Vio) 


1 — ciploido ) eee 


2B 


< Glv + dv)dv. And conversely.) Similarly, if C(c) denotes the supplier's ex- 
pected profit when his cost is c, then Cic) = —|1 — F(o)]. Last, free participa- 
tion implies that V(x) > 0 and C(z) > 0, Hence, 


Viv) > { Gladdr 


and 


ews | in — Pails 


A simple integration by parts then shows that 
f V(o)Hv)dv + f Cle)gledde 


strictly exceeds W if 7 > v (overlapping supports)—a contradiction, 

Myerson and Satterthwaite (1983) characterize the optimal bargaining 
mechanisms (of course, the actual bargaining process need not be one of these), 
Cramton et al. (1987) show that inefficiency need not result if ownership of the 
good is initially distributed in a more symmetric way (as in a partnership), and 
generalize the Myerson-Satterthwaite characterization of optimal mechanisms. 
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efficient arrangement takes the even simpler form of a 
fixed-price coytract, For instance, if c is common knowledge 
at the contrdcting date, the contract can be written in the 
following mpnner: “The buyer determines the quantity to 
(here, 0 or 1). The delivery price is c.” 

With bilateral asymmetric information, assigning one 
party the right to choose the price, or to choose the 
quantity at f previously agreed-upon price, in general is 
no longer ¢fficient. The parties will want to consider 
alternative 4rrangements. A particularly simple rule, the 
“tigid norm" fixes the amount of trade and the price in 
advance. Jt bpecifies that the parties will trade, whatever 
the realizatfons of v and c, Such a rule, of course, is 
efficient if ahd only if the parties are ex ante certain that 
there will bd gains from trade. (In this case, the rigid norm 
is generally| strictly superior to giving either party the 
right to chopse the price.?*) 

In summbry: Ex post bargaining may not lead to the 
efficient volume of trade. Some constraints (if possible, 
simple ones) on the second-period decision process must 
be contract¢d for. When one party's information is (re- 
spectively, pecomes after contracting) common knowl- 
edge, the other party should be given the right to choose 
whether to frade at a given price (respectively, to choose 
the price). Ihe power should go to the informed party. 


a fixed order for the book at an observable price), the 
printer should be given the authority to pick the price. 


Specific Investment and the Hold-up Problem 


Suppose that at date 1 a supplier invests in cost reduction 
(his investment reduces c) and a buyer invests in value 
enhancement (his investment increases v). These invest- 
ments are specific in that they would not reduce cost or 
increase value if the parties were to trade with other 
parties. 


Bargaining 


Let us, as above, start by assuming that no contract is 
signed in period 1, Thus, the two parties bargain in period 
2 over whether to trade and at what price. 

Obviously, if the ex post volume of trade is not effi- 
cient, investments will be affected accordingly. If there is 
too little trade, the supplier and the buyer have an incen- 
tive to invest less than under efficient trade, because the 
probability that their investments are “used” is smaller 
than the optimal one. To separate the issues of efficient 
trade and efficient investment, we will assume that, ex 
post, v and c are common knowledge. Thus, the parties 
trade if and only if v >c. We can thus focus on the 
dependence of ex ante specific investments on the ex post 
split of gains from trade. What fraction of his own value- 
enhaneing (cost-reducing) investment can the buyer (sup- 
plier) recoup? 

Assume that the buyer's value is known at the contract 
date to be v = 3. The supplier can “invest” (spend I = 2) 
or “not invest” (spend | = 0). If he invests, his ex post 
marginal cost is c = 0. If he does not invest, his marginal 
cost is “high” (>3). Suppose that ex post the parties’ 


does not arise if the parties sign an optimal contract before 
learning their pfivate information as was shown by Arrow (1979) and d’Aspre- 
mont and Geraifl-Varet (1979), The advanced reader may check this as follows: 
Suppose that ofis, a priori, distributed on |p, 3) according to the cdf. F with 
density fand ¢ dn (¢, | according to the cdf. G with density g, and that the two 


after learning if in the second period. Let @ and & denote the announcements 
(which need nof be truthful). The rule specifies that trade occurs if and only if 
5 > é. Thus, itis efficient if the parties tell the truth. The payment from the 
buyer to the shller is p(@,¢) and is unconditional on trade (is made even if 
5 <@), Itis given by 


v6.2) = [  feonto — [orcas + constant 
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This payment induces the parties to announce the truth, It is called an expected 
exteriality payment because, for instance, the buyer, by increasing the announce- 
ment from & to. + do, increases the payment by d.g(@)dv, which is nothing but 
the expectation of the cost of the extra volume of trade created, (The new trade 
occurs when ¢ = Ebelongs to |, 5 + dol, which occurs with probability g(3)dv.) 
For extensions of this mechanism (which violates the ex post nonnegative 
surplus condition for some values) see Maskin 1985, Pratt and Zeckhauser 
1985, and Johnson et al. 1986. 

The expected-payment mechanism described here is always efficient. but is 
more complex to set up than the ones envisioned in the text, (Subjective 
distributions must be described, and the private information must be conveyed 
toa third party—or a recording machine—ex post, The good to be traded must 
also be precisely described at date 1.) 


30. The effect of this reduction in the volume of trade is emphasized in Tirole 
1986a. 
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Let fs next consider specific investment. The supplier 


mak#s profit 0 if he does not invest. He makes profit 


q To simplify, assume that v > c(0). Let the price 
j post determined by the Nash bargaining solution: 
F [c(I) + vl/2 (so that » — pil) = pl) — c(2)), once 
Thas [Peer invested (using v > c(I)). The supplier's profit 


to the supplier. The other 50 cents are held up by the 
|. The privately optimal investment is —c'(J) = 2. In 
contrpst, the socially optimal investment solves 


emphtsizes, ex post bilateral monopoly plus bargaining 
yields underinvestment in specific assets, A power com- 


of the investment once it is sunk, an employee will not 
invest in learning firm-specific skills if he has no guarantee 
that the firm will not exploit his captive position later on, 
and so forth. 

This simple model also allows us to see the effect of the 
degree of asset specificity and the existence of outside oppor- 
lunities, Introduce (ex ante and ex post) a large number of 
buyers who all are willing to pay v for the good. But there 
is still asset specificity in that the investment, J, is geared 
to a particular buyer (“the specific buyer"), If the supplier 
trades with any other buyer, his production cost corre- 
sponds to a fictitious investment of iJ, where 1 belongs 
to [0, 1]. (A = 0 is the most extreme form of asset specific- 
ity, and 2 = 1 corresponds to the absence of specificity.) 
Suppose that the supplier has invested J. By not trading 
with the specific buyer, he obtains price v (because of the 
competitive behavior of other buyers) and gets surplus 
v —c(Al). Hence, still under the assumption that bar- 
gaining with the specific buyer leads to an equal division 
of the gains from trade, trade takes place with this specific 
buyer at price p such that 


u—p=(p—c(l)] — lv —c(al)). 


One immediately sees that the supplier has a higher 
incentive to invest than in the absence of outside oppor- 
tunities. Outside opportunities for the supplier raise his 
status quo (disagreement) payoff and put him in a better 
bargaining position. The supplier's intertemporal payoff 
is now 


v — 4le(l) + c(Al)] — 1 
The ex ante choice of investment yields 
—le'(I) + Ac'(Al)] = 2. 


When 4 = I (no asset specificity), p = v and the invest- 
ment is socially optimal. When 4 = 0 (full asset specific- 
ity), the investment level is the same as in the absence of 
outside opportunities. For / in (0, 1), as long as the curva- 
ture of the cost function is not too high, the investment 
increases with /. 


31. See|Nash 1953. Nash used an axiomatic approach to derive his bargaining 
solution} he also justified it as the outcome of a simultaneous-move “demand 
game." ne justification of the Nash solution in terms of sequential, non- 
cooperafive bargaining is that given by Rubinstein (1982) and Binmore (1982) 
for the dase where the parties have no outside opportunity. (See the section on 


dynamic games of complete information in the Game Theory User's Manual at 
the end of the present book.) 


32. For an application of this type of reasoning to labor markets, see Grout 
1984, 
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tracts specifying the process through which the 
it of trade and the transfer are determined ex post. 
ill suppose that specific investments are observable 


mon pains from trade in bargaining situations, it may 
be defirable to contract ex ante to impose penalties for 
breacl}. These penalties bind the parties to each other and 
prevent opportunism. The extreme case of penalties for 
breacl} is when the parties agree in advance to trade at a 
given|price “no matter what” (this corresponds to infinite 
penaltfes). Because the price is given, specific investments 
cannof be expropriated (mathematically, the dependence 
of p of J is eliminated). 

igh penalties for breach are in a sense a measure of a 
-in relationship. They automatically keep the rela~ 


isms must be contracted for when there is a non- 
igible probability of no gains from trade. There are 
some imple cases, however. For instance, suppose that v 


and c are random from a first-period viewpoint (with the 
possibility that v < c) but become common knowledge 
between the two parties at the beginning of the second 
period. Suppose further that only one party—the supplier, 
say— invests (so that c depends stochastically on invest- 
ment 1). In this case, the sequential authority mechanism 
by which the supplier chooses the price and the buyer 
accepts or refuses to trade at this price is efficient (i.e., 
yields the optimal levels of investment and trade). Be- 
cause of ex post symmetric information, the volume of 
trade is optimal. The supplier offers price v if v >c and 
any price strictly greater than v if » < c. Trade takes place 
if and only if v > c and all the ex post gains from trade go 
to the supplier, Because the volume of trade is optimal 
and the price does not depend on investment, the supplier 
chooses the socially optimal investment.*? If v is already 
known at date 1, this mechanism amounts to fixing the 
price at v at that date and letting the supplier choose 
whether to trade in period 2. Thus, the rough rule is that 
the party investing should have the authority over the price, 
or over the trading decision if the other party's informa- 
tion is known in advance.** (Gains from trade can be 
divided ex ante through a lump-sum transfer.) 


Exercise 1** In the text, it is assumed that the supplier's 
investment reduces his production cost. Assume instead 
that the ex ante investment affects the quality of the pro- 
duct and thus the value to the buyer. The buyer's ex post 
value is v(/) = 31 — 4]?. Hence, the buyer's surplus in 
case of trade is v(I) — p. The supplier's surplus is then 
p—c—l1 (where c<} is now a constant production 
cost). Suppose that J (and, hence, 0) is observable by the 
buyer; however, it is not verifiable by a court, so that it 
cannot be specified by a contract. 

(i) Determine the efficient amount of investment. 

(ii) Suppose that there is no contract and the two parties 
bargain ex post according to the Nash bargaining solution. 
Is the investment optimal? Point out the externality. 

(iii) Suppose that the parties sign a contract specifying 
that the buyer has the right to buy the good at a given 


33, Let [F(w), flv), GlelI), and g(é|!) denote the cumulative distributions and 
densitied of » and ¢ (with G/al > 0), The supplier salves 


wwe (fl 


which islalso the socially optimal program. because p(o, ¢) = w whenever v > ¢ 


Iplove)—elfio)gtelt dude — 1), 


34. Bilateral investment or supplier's investment and asymmetric informa- 
tion about v calls for even more sophisticated mechanisms, The expected- 
exterality-payment mechanism described in footnote 29 yields the efficient 
volume of investment (as well as trade) even when both parties invest and there 
is ex post bilateral asymmetric information. 
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price p. Is this contract efficient? What if the supplier has 
the right to sell at a given price? 

(iv] What happens if the supplier is given the right to 
chooge the price ex past? 


Remajk The preceding discussion suggests that v and 
c be gudited when that is feasible. In bargaining, perfect 
auditing makes the value and the cost common knowl- 
edge|and thus prevents inefficiencies associated with 
incon}plete information. In a contracting framework, it 
allow} the disconnection of the trading decision and the 
transfprs so that the trading decision can be based entirely 
on auditing. More information gives rise to gains in effi- 
ciency wherever the contract is inefficient in the absence 
of aufliting.?* Williamson (1975, p. 29) has argued that 
integfated firms may be more susceptible to auditing than 
nonintegrated ones.*° In particular, he suggests that it 
is eaqier to audit an internal division than an outside 
contractor offering the same services, because external 
auditgrs are regarded suspiciously and face collusive 
behavior on the part of employees, who may impede 
the rdlease of information.’ Another argument may be 
that ft firm has the legal right to audit its divisions 
but ro such right to audit outside contractors (except 
in extreme circumstances). Grossman and Hart (1986) 
argue| that integration per se is unlikely to change the 
structhire of information, on the grounds that noninte- 
grated! parties can sign a contract that mimics the audit- 
ing ppssibilities of the integrated firm. In particular, each 
party|can waive his right not to be audited by the other 


The Limitations of Long-Run Relationships 


The most obvious and important limitation of a long-run 
relationship is the presence of outside opportunities. As 
was noted above, forcing the parties to stick to each other 
through high penalties for breach may hurt them if there 
are no gains from trade or, equivalently, if better outside 
opportunities are available to one or both of the parties. 
As breach may be desirable, the contract must find the 
optimal trade-off between flexibility and the prevention 
of opportunism.*? 

The loss of advantageous outside opportunities is not 
the only hazard of long-run contracts, Long-run rela- 
tionships tend to promote collusion between the units’ 
personnel (Tirole 1986b). A long time horizon gives them 
time to reciprocate favors and to become confident that 
collusion is sustainable. For instance, Pettigrew (1972) 
notes how managers of a business firm may, in the long 
run, identify with particular suppliers.*° The possibility 
that such collusion will create inefficiencies calls for the 
rotation of personnel within each unit, or, when that is 
costly (because of job-related human capital, say), the 
occasional switching of suppliers or buyers. 

Another limitation of long-run relationships may be 
due to the fact that a short-term relationship is generally 
more advantageous to a party who knows he will have 
good outside opportunities in the future. Because good 
outside opportunities tomorrow are related to his general 
ability to perform well and thus improve his bargaining 
position today, this party has an incentive to signal them 
through the signature of a short-term contract (possibly 
disguised as a long-term contract).*! 


37, Thip is not to say that collusive behaviors do not impair accounting within: 
a firm. $e. e.g., Dalton 1959, p. 206. 


on the possibility of writing a contract that specifies 
items related to the parties’ trade in case a party does 


ing spedific auditing rights is costly or hard to perform ex ante, intemal organi- 
zation thay be superior in auditing respects, Williamson (1975, p. 146) applies 
a simila} argument to argue that a firm’s general office has superior auditing 
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possibilities with regard to its divisions relative to the capital market, See 
subsection 1.4 on incomplete contracts and authority. For a discussion of why, 
in an incomplete-contract framework, the structure of authority may affect the 
structure of information, see Holmstrom and Tirole 1987. 


39. Foramodel of search over the possibility af superior outside opportunities, 
see Harris and Holmstrom 1983. 


40. Ina similar spirit, some Department of Defense officials become friendly 
with the firms they have repeatedly dealt with, and some consultants lose their 
objectivity when they stay with a firm for a long time, For analogous reasons, 
auditing firms rotate their personnel. 


Al. This point is most strikingly made by Hermalin (1986) in the context of 
labor markets, Hermalin examines the implications of this signaling, behavior for 
training. See also, in Aghion and Bolton's (1987) reconsideration of the market- 
foreclosure doctrine, the case of a current monopoly supplier of a good who 
tries to signal that the technology makes entry unlikely in his market by 
lowering penalties for breach for the buyer. A short-term contract is a long- 
term contract that does not specify penalties for breach, 
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An Application: Repeated Franchise Bidding 


We can ow come back to Williamson’s warning against 
the hazaids of franchise bidding.*? First, he argues that 
the incuntbent's specific investments, in order to be trans- 
ferred tojan alternative supplier at a fair rate (so as to 
induce tHe incumbent to make them), must be observable 
and veriljable. But equipment is often hard to evaluate. 
Good redords must have been kept, physical depreciation 
must be theasured adequately, and no kickbacks may have 
been rectived by the incumbent in exchange for an in- 
flated price for equipment. Even harder to measure are 
specific investments in the human capital of the incum- 
bent's personnel. So a fair reimbursement of investment 
cannot cme from simple accounting data. Rather, it must 
be ensurtd by bidding over the equipment.** After all, 
the markpt assigns higher values to firms that have com- 
mitted more investment. But this also creates problems. 
On the pne hand, the incumbent's transferable invest- 
ments mhy not be observable by the other bidders. On 
the other hand, some of these investments (such as human 
capital) tpay not be recoverable by them.** The use of 
franchise] bidding is thus likely to be costly in industries 
with high specific investments (such as the electric power, 
gas, telephone, and cable television industries) and more 
advantagleous in industries in which only a small frac- 
tion of ihvestments is sunk (as—perhaps—in the case 
of the allocation of airline routes, or as in Demsetz’s 
hypothefical example of the production of automobile 
license jplates under increasing returns with general- 
purpose Pquipment).** 


Empirica] Investigations 


The Williamsonian theory of long-run relationships sug- 
gests tht firms should write long and detailed contracts 
where tat is feasible and not too costly, and that the 
incentive to do so increases with the lack of ex post outside 


opportunities and the specificity of investments. Joskow 
(1985, 1987) has made a detailed study of contracts be- 
tween coal mines and electric utilities in the United States. 
(Most such relationships are run by contracts; vertical 
integration is rare.) The specific investments that run the 
tisk of being expropriated here are, for the coal mines 
(suppliers), investments in mining capacity; for the electric 
utilities (buyers), they are investments in generating units 
and in boilers adapted to a particular type of coal.*® 

One can distinguish between two polar geographical 
cases (of course, in practice things are not that clear cut). 
In one case, there are a large number of electric utilities 
and coal mines in the region, and transportation facilities 
for coal (railroads, barges) are numerous and competitive 
so that parties have a large choice of their partners (even 
ex post), Furthermore, the coal produced by the various 
mines is quite homogeneous, so boiler design is fairly 
irrelevant. In such a region, the hazard of expropriation 
of specific assets is small, so spot markets (short-run con- 
tracts) are relatively efficient. In the other case, there 
are few coal mines, transportation facilities are limited, 
and the quality of coal is very variable. One would then 
expect complex long-term contracts (or vertical integra- 
tion) to support exchange. 

As Joskow shows, the regional differences in the United 
States offer a dramatic illustration of Williamsonian pre- 
dictions. In the East, underground mining (60 percent of 
production) offers few returns to scale; as a consequence 
there are many small mines, Transportation facilities are 
numerous, and the quality of coal is relatively homo- 
geneous. In the West, where surface mining is prevalent, 
there are large returns to scale, so there are only few large 
mines. There is also less competition for transportation 
than in the East, and the quality of coal is very variable. 
Joskow’s study shows that contracts are of a much longer 
duration in the West than in the East (and that the spot 
market for coal is very important in the East and virtually 


43, Here w must distinguish between bidding for the equipment (the proceeds 
of which gf to the incumbent) and bidding for the monapoly position (the 
proceeds of|which go to the regulator). 


‘A. To thefe inconveniences can be added the pre-bidding costs associated 
with the "inftchet effect.” The incumbent firm has little incentive to be efficient 
before the few award of the monopoly situation, because this would convey 
favorable information about its technology, inducing competitors to bid higher 
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and possibly inducing the regulator to fix a higher minimum price in the 
auction. For related analyses of the ratchet effect, see Freixas et al. 1985 and 
Laffont and Tirole 1988; and for an application to the labor market, see Gibbons 
1987, 


45. See Laffont and Tirole 1987 for an analysis of franchise bidding with 
unobservable transferable and nontransferable investments. 


46. Coal-specific boilers are more efficient than boilers designed to allow 
flexibility in the use of coal 
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norfexistent in the West). Note here that the lack of out- 
sidé opportunities is related to asset specificities. 

other check of the theory involves “mine-mouth 
ts" —electricity-generating plants located near coal 
s. Lack of competition and high transportation costs 
induce an electric utility to locate near a coal mine, 
+h creates site specificities, Joskow found that mine- 


ex ante causes of incompleteness are hard to for- 
malizp. We have no well-established theories of complexity 
individual decision-making under unforeseeable 


contingencies—two economically important phenomena, 
Most existing contracts do not specify many relevant 
contingencies. When these unspecified contingencies 
occur, the actions of the concemed parties are likely to 
lead to conflicts. We can distinguish two polar cases of 
decision processes that, ex post, handle the unforeseen 
contingencies. 

The simplest decision process, bargaining, was con- 
sidered in subsection 1.3. There, we first assumed that no 
contract at all was signed ex ante, and that the parties 
bargained over the trading and transfer decisions after 
investing and learning their valuations and costs. The 
bargaining process was unconstrained by previous con- 
tracting; there was only the legal constraint that ex post 
trading be voluntary. We contrasted the solution with the 
one obtained under a complete contract. 

Intermediate forms of contracting which exist between 
no contract (and unconstrained ex post bargaining) and a 
complete contract are also available. These intermediate 
forms can save on transaction costs relative to complete 
contracts, but without the perverse effects of uncon- 
strained bargaining. We distinguish two possibilities, The 
first is that the two concerned parties resort to a third 
party. This third party is supposed to make the efficient 
decisions that most closely resemble those that a com- 
plete contract would have specified, His ex post trading 
and transfer decisions must, therefore, yield ex post the 
efficient volume of trade and encourage ex ante the right 
amounts of specific investments. The second possibility 
gives one of the two concerned parties, rather than a 
third party, the right to determine what happens in an 
unspecified contingency. 


Arbitration 


The first type of resort to a third (unconcerned) party 
is external arbitration. For instance, a union and a firm 
may agree to go to binding arbitration if negotiations 
on a labor contract become stalled. Similarly, a supplier 


47. A Hfroader perspective on incomplete contracts can be found in Holmstrom, 
and Tirble 1987 


48. A ‘komplete contract” is a contract that has the relevant decisions (transfer. 
trade, e|c.) depend on all verifiable variables, including possibly announcements 
by the parties (concerning their valuation, cost, etc.). By abuse of the terminol- 
can also call a contract that is a priori incomplete a “complete contract” 
iF it yiellls the parties the same payoffs as the optimal complete contract. 


49. Empirically it may be difficult to distinguish between the two. For instance, 
the adaptation of secretarial tasks to the introduction of word processors may 
well not have been included in labor contracts before their introduction, not 
because nobody had thought about the possibility of their advent but because 
thinking about all the potential forms of word processing, and including them 
into the contract would have been prohibitively time consuming, and costly. 
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and a buyer may agree to accept arbitration by outside 


ions or its employees. Day-to-day contact with 
rations and personal experience with the em- 


rbitrator must be able to learn about and under- 
e situation at relatively low cost, and must be 


is jeopardized when executives are poorly informed’? 
The sefond condition, independence, requires that the 
arbitratbr not be both judge and party. He must make 
decisiows that conform to the aggregate interests of the 
two patties, and not favor one to the detriment of the 
other. Independence may fail, for instance, when a chief 
executive officer has kept close ties with a particular divi- 
lore generally, the internal arbitrator, like the 
external one, must be trusted, or else must develop a 
reputation for settling disputes “fairly” (that is, the arbi- 
trator rust confirm the parties’ expectation that he will 
make tle efficient decisions—the ones that would have 
been sppcified in a complete contract).53 


sion, 


Authority 


The power to fill unspecified contingencies —authority— 
may be given to one of the concerned parties rather than 
to an arbitrator. As Grossman and Hart (1986) and Hart 
and Moore (1985) note, authority does not mean that the 
concerned parties do not negotiate ex post. The preferred 
decision of the party who has authority may be very 
costly to the other party. Some alternative decision may 
be mutually advantageous, and the party with authority 
may coerce some benefits by not exercising this authority. 
An important insight of the Grossman-Hart-Moore anal- 
ysis is that authority changes the status-quo point in the 
bargaining process—it puts the party that has it in a better 
bargaining situation. In tum, the ex post division of the 
gains from trade will affect ex ante investments. 

To see how authority redistributes the gains from 
trade, suppose with Grossman and Hart that, ex post, 
the parties (the buyer and the supplier) must make some 
decision d in a set D. They have ex post monetary payoffs 
Bd) where i = 1, 2.4* Giving authority to party 1 (the 
buyer or the supplier) means that he is allowed to choose 
ex post the d that he prefers. Thus, if the two parties 
cannot agree on an alternative decision, party 1 chooses 
d? so as to maximize B, (da). But if df does not maximize 
party 2's payoff, the two parties will, in general, have an 
incentive to renegotiate to implement d* (the decision 
that maximizes their joint payoff B,(d) + B,(d)). As- 
suming that the transfer f from party 2 to party 1 is such 
that gains from renegotiation are evenly distributed (ie., 
assuming the Nash bargaining solution), we have 


IB, (d*) + #] — B, (df) = |B, (d*) — # — B,(d*). 


Letting B, and B, denote the final benefits (given the 
transfer and decision d*), we have 


B, = By (d}) + 41B,(d*) + B,(d*) — B, (df) — B,(d})] 


that this mpsort need not be contracted for in advance. However, courts— 
beyond erforcing "reasonable contracts’—only rule on "unfair practices” (i, 
tule out sdme types of behavior in the absence of agreement). Arbitrators cant 
be given nore extensive powers (e.g;, to choose the outcome in the absence of 
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Conversely, intemal arbitration may adopt procedures similar to those 
of external arbitration. In particular, some firms have internal grievance 
procedures, 


52. Authority may then have to be delegated to lower levels, es, the divisions, 


53. Managers’ reputations for setting disputes fairly (ie., efficiently) make up 
part of what is meant by “corporate culture,” 


54, The decision d may be a trading decision, and B may be a value ar a cost 
But the issue is much more general. For instance, d could denote a quality or a 
design decision, or B, could denote the effort exerted by party 1. Note that d can 
be multidimensional, 
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t] > By (a3), 
< B,(d3), 


es to invest. Under renegotiation, the status quo, 
if it is not observed, influences final payoffs; thus, 
ibution of authority still affects the incentives for 
ent. To be more specific, one must describe how 
investments and the decisions affect payoffs (see 


accumulated customers to new employees, the customers 
belong to the firm rather than to the employee; this distri- 
bution of ownership is enforced by a noncompetition 
clause. (Sometimes an implicit distinction is made between 
old clients and those brought in by the employee, as the 
theory would predict.) An engineer cannot easily quit 
his firm and patent an invention made possible by his 
firm's research effort. Grossman and Hart also cite the fact 
that insurance companies tend to use direct writers (i.e., 
employees who do not own their lists of clients) when the 
agent's specific investment in keeping customers is weak 
{as in life insurance) and independent agents (who do own 
their lists) when it is substantial. 


Example 


In the spirit of the Grossman-Hart analysis, this exam- 
ple shows how the distribution of authority affects the 
division of the gains from trade and the incentive to 
invest, 

A buyer and a supplier contract to trade tomorrow. 
Trading is not an issue, and the parties agree that the 
good is to be exchanged in any case, The only uncer- 
tainty concerns the final specification of the good, A basic 
design is contracted for at date 1, but an opportunity to 
improve its quality may arise at date 2 that cannot be 
described at date 1 (one can imagine that there is an 
“infinity” of such potential improvements, of which only 
one will prove relevant). Whether the quality improve- 
ment will be made cannot be directly determined at date 
1. The relevant quality improvement is learned by both 
parties at date 2. The second-period cost to the supplier, 
¢, is greater than zero, For simplicity, we suppose that c is 
known at date 1 and is independent of the particular 
improvement, The buyer picks an investment in period 1. 
His second-period value for the improvement is v > ¢ 
with probability x and 0 with probability 1 — x; the cost 
of investment, J, is equal to x*/2. It is not clear to out- 
siders which level of investment has been chosen, so 
that the parties cannot contract on it, Note that » and 
c are extra valuation and cost (beyond the values corre- 
sponding to the basic design). 
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dimensionality of decision making is high, nonintegration with respect to the 
all decisions is prevalent. To make the distinction mare relevant; itis natural to 
focus on a small dimension of the authority structure. such as those decisions 
that result from the ownership of a few physical assets; this is what Grossman 
and Hart do. 
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The irjterpretation of the investment technology is as 
follows: [The buyer invests in flexibility. A higher invest- 
ment incfeases the probability that the improvement that 
comes about can be used. For instance, an employer (a 
buyer) cdn train his workers to adjust to changing tech- 
nologies] as in Piore and Sabel’s (1984) theory of flexi- 
bility, ora power company (a buyer) can choose a more 
costly ard flexible boiler designed to adapt to some varia- 
e coal from a mine (the supplier). 

Let ug first look at the social optimum in this model. 
Obviously the quality improvement should be made if 
and onlyjif the buyer has value v. The optimal investment 
is then glven by 


tions in 


max [x(v]— c) — x7/2). 


Thus, x*]= v — c. The joint surplus is W* = (v — 0)?/2. 
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Hence, 
x8 = (v — 0/2 = x*/2. 


This result is nothing but our previous underinvestment 
result under unconstrained bargaining. Joint surplus is 
equal to 


W® = x(v — c) — 27/2 = 3(0 — 0)*/8 = 3W*/4, 


Supplier control in this simple model is equivalent to 
unconstrained bargaining, because the status-quo point is 
the same: If the parties disagree, the supplier chooses not 
to make the improvement (the less costly action for him). 
His right allows him to use strict compliance with the 
original contract as a threat to get a good bargaining 
position. This situation describes roughly what happens 
in negotiations between the Department of Defense and 
defense contractors over design changes. The contractors 
are, in general, able to use their right not to make the 
changes to derive large profits from such changes. Be- 
cause of the equivalence, x8 =x" and WS = W# 
(where SC stands for supplier control). Again the buyer's 
investment is “half-expropriated,” and the buyer conse- 
quently underinvests. 

Under buyer control, the improvement would always be 
made if the status quo were not renegotiated, (Actually, 
when the value to the buyer is zero, the buyer is indif- 
ferent between imposing the improvement and not doing 
so; we assume he does. On the one hand, he would do so 
if the value were only slightly positive; on the other hand, 
our conclusions would be even stronger if he did not 
impose the improvement.) If the value is v, the status quo 
is efficient and no bargaining occurs. The buyer gets v by 
imposing the improvement. If the value is 0, the status 
quo is inefficient and bargaining divides evenly the gain 
(c) of not making the improvement. In particular, the 
buyer gets c/2. Thus, the optimal choice of investment for 
the buyer is given by 


éfpa— Bee 
max x wars. 


which yields 
x =y—c/2>x* 
and 


Ww®* = 4(v — c/2)(v — 3c/2). 


The Theory of the Firm 


The #triking result here is that the buyer now overinvests. 
This }s due to the fact that his authority will allow him not 
the production c if his value is v, Because he does 
not iftternalize this production cost, he overinvests in the 
activ|ty that makes production more likely. 

It }s easy to see that it may be optimal to give the 
authqrity to the buyer or to the supplier. When c= 0, 
buyef control is socially efficient (there is no noninternal- 
ized production cost); supplier control is not. If 7 = c > 0, 
then Jno investment is optimal, and supplier control (or 
uncoistrained bargaining) is optimal; buyer control en- 
courages investment and yields a negative joint surplus. 


The Jcope of Authority 


We hive assumed that the decision set D from which the 
party|with the authority can choose is well delineated ex 
ante. [This may be inconsistent with the transaction-cost 
hypothesis. If contingencies are unforeseeable or are too 
numejous to be included in the original contract, the 
decisipn set that includes these contingencies is also likely 
to be|unforeseeable or too complex to be described in 
that cpntract (indeed, the contingencies may be the fea- 
sible ctions themselves). But how is D defined if it is 
not piecisely specified in the original contract? Grossman 
and Hart trace authority to the ownership of some 
physiqal asset; the idea is that the owner of a machine has 
the right to use the machine as he wishes. Kreps (1984) 
notes |that ownership may also pertain to intangible 
assets/ such as reputation. It can also be identified with 
a fundtion within a firm, through delegation of author- 
ity. The foreman may have some authority over shop 
management. 

Although ownership (possibly in a delegated form) 
helps define the set D, it cannot fully determine it. Even if 
the owner of a machine is free to allocate it between 
different uses, his elimination of its noise dampener may 
not bd considered by the workers or the courts to be 
within] the scope of his authority. A division manager 


secretaries to use word processors with sight- 
damag|ng screens. The Department of Defense may have 
the authority to impose some uncontracted but standard 
safety Hevices on defense contractors, but not to require 
major design changes. As Kreps rightly notes, there must 
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be some common understanding as to the scope of au- 
thority, even when this scope is not specified in advance; 
and such an understanding (an aspect of corporate cul- 
ture) is facilitated when unforeseen contingencies follow 
patterns. 

The ex post definition of the scope of authority creates 
a role for arbitration similar to that created by the defini- 
tion of the decision d itself: An internal or external arbitra- 
tor may not be competent to fine-tune the decision but 
may have enough information to delineate the set of 
acceptable decisions. Indeed, in a typical firm the exercise 
of delegated authority (foreman over worker, say) is made 
possible by the presence of a superior authority (a man- 
ager, say) with powers of arbitration. 

Another possible safety valve against the abuse of 
authority is to allow the party without authority to ter- 
minate the relationship, so that, ex post, he has the author- 
ity to reject the other party's authoritarian decision, 


Empirical Investigations 


Vertical integration, like long-term contracts, is more 
likely the more specific the investments are. This may 
make it difficult to distinguish between the two empiri- 
cally. The main conclusion of the Coase-Williamson anal- 
ysis in this respect is that vertical integration is more 
likely (relative to a long-term contract) when “transaction 
costs” are high. One can, of course, conjecture that trans- 
action costs are high in situations with considerable tech- 
nological uncertainty. (This may explain why coal mines 
and power plants are rarely integrated, although another 
explanation for the low level of vertical integration in 
this industry is the regulator's resistance to it.) Unfortu- 
nately, degrees of “unforeseeability” and complexity 
are hard to measure empirically; good proxies must be 
found. 

Most case studies and regression analyses have fo- 
cused on the effect of the specificity of investment on the 
likelihood of vertical integration. Klein, Crawford, and 
Alchian (1978) offer a particularly interesting analysis of 
how asset specificities led to vertical integration between 
General Motors and Fisher Body and between pipeline 
firms and oil explorers or refineries. Monteverde and 
Teece (1982) look at the procurement of automotive com- 
ponents by U.S. automakers. To explain why some are 
purchased whereas others are manufactured internally, 
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(in particular) the following variables: the speci- 
ficity of[a. component to the automaker (Is the component 
designeHl specifically for that automaker? Can it be bought 
from other suppliers on short notice?) and the complexity 
of the system in which the component must be inserted 
(engine) chassis, etc.).*° They show that, in particular, 
the spegificity variable is a significant determinant of 
the integration decision. Masten (1984) finds analogous 
results for aerospace manufacturers, In a similar spirit, 
Anders¢n and Schmittlein (1984) study the integration 
of the gales force (manufacturer's representative versus 
employ¢e salesperson) and show that the degree of human- 
specific tapital (measured by the manager's assessment of 
the diffifulty in learning the ins and outs of the company, 
the natyre of the product, the nature of the consumers, 
and so forth) is negatively related to the probability of 
using inlependent representatives.*” 


Reputation as a Substitute for Contracts or Integration 


The conkept behind this and previous subsections is that, 
in order|to avoid future hazards, parties should sign com- 
plete copttracts, or, if contracts are too costly or impossi- 
ble to write, should at least make a correct use of the 
authority structure (restricted contract). In practice, how- 
ever, M4Caulay (1963) found that relations between firms 
tended fo be more informal than was predicted by the 
theory, [This is often true even when firms engage in 
long-run relationships. Efficiency is then sustained by the 
firms’ reputation. A firm that cheats at some date (i.e, 
makes dkcisions that are not jointly efficient) runs the 
tisk of Ising future profitable deals with its partner (see 
Williamson 1975, chapter 6, Kreps 1984, subsection 2.2 
of this chapter, and chapters 2 and 6 below). Reputation 
allows firm to save on the costs of writing complete 
contracty or even on the costs of distributing authority, 
On the ¢ther hand, informality exposes the firms to the 
threat of opportunism. Thus, one would expect infor- 


mality to be most prevalent when specific investments are 
limited and when trade is sufficiently frequent that the 
incentive to cheat is low. 


Dual Sourcing as a Substitute for Contracts 


An alternative way of avoiding the ex post hold-up prob- 
lem is to introduce ex post competition whenever that 
is feasible. Farrell and Gallini (1986) and Shepard (1986) 
have analyzed Williamsonian models in which the buyer 
invests in specific assets and the seller chooses, ex post, 
some ex ante noncontractible variable (call it “quality”).5* 
Ex post, the seller has an incentive to choose low qual- 
ity**; therefore, ex ante, the buyer invests little in the rela~ 
tionship. Dual sourcing consists in having two or more 
suppliers, who compete ex post on quality. This raises 
the equilibrium level of quality and the ex ante invest- 
ment. Competition can thus alleviate the ex post bilateral 
monopoly problem and raise efficiency. Farrell and Gallini 
and Shepard argue that this is a persuasive explanation of 
why Intel licenses its microprocessor technologies and 
why IBM adopts an “open architecture” policy in regard 
to its personal computers. 


2 The Profit-Maximization Hypothesis 


It is a postulate of this book, and of most economic 
theory, that firms maximize expected profits. There is, 
however, a widespread feeling that in practice their man- 
agers have other objectives (e.g,, maximizing the firm's 
size and growth and the perquisites of the managerial po- 
sition).°° This section presents arguments for and against 
the profit-maximization hypothesis. It also discusses the 
power of the current theory of industrial organization 
in the presence of non-profit-maximizing firms. 

The shareholders of a firm are claimants for its revenue, 
net of various input costs. Thus, if they were able to 
tun the firm, they would choose decisions that would 


behind the introduction of this variable is as follows. Both 
Williamson ]1975) and Scherer (1980, p. 90) have argued that vertical inte- 
gration permits executive fiat to obtain better coordination (because the exact 
timing of thf production process is generally left indeterminate in the incom- 
plete contra), A complex system may require more coordination of inputs. 


57. See Wilfamson 1985 for other interesting examples, 
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58. This variable is a price in Farrell and Gallini 1986 and a delivery lag in 
Shepard 1986. 
59. In these models, a high price or a long delivery lag. 


60. See the models of firm behavior of the 1950s and the 1960s (e.g. Baumol 
1962, Maris 1964), 
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ize cost and maximize profit.°? Thus, non-profit- 
ization is mainly associated with the separation of 


taken the approach that firms’ deviations from 
aximizing behavior should be explained rather 


growth may be desired by the managers not for its own 
sake bhit because it allows them and their subordinates to 
enjoy preater opportunities for promotion. 

Revjewing the principal—agent literature and alterna- 
tive approaches is beyond the scope of this chapter.®? 
This tyeatment will be restricted to the main issues. First 
we will consider the basic moral-hazard problem and how 
direct monetary incentives, yardstick competition, take- 
over Hids, product-market competition, and supervision 
can refluce managerial slack or discretion. The limits of 
these Fontrol mechanisms will be pointed out. It will 


61. Therp are two reasons why shareholders might not want to maximize 
expected}profit (or net market value), First, they may be risk-averse; they may 
then wanf the firm to make decisions that lead the firm’s profit to be negatively 
correlated with the economy's fluctuations (so as to get a less risky portfolio) 
even if tHpse decisions do not quite maximize expected profit. Second, and also 
ina genfral-equilibrium context, a firm’s price for one of its products. say, 
influenced the shareholders’ welfare not only through the firm's profit but also 
through their consumption of the firm's good as long as the firm is not perfectly 
competitWve, (Actually, shareholders may not even agree on the firm's best 
decision Jn such cases) These two general-equilibrium effects are important 
theoreticflly, but they do not look very strong empirically, The shareholders’ 
portfolios relatively well diversified, and it is not clear how most deeisions of 
the firm felates to aggregate shocks in the economy, Furthermore, the share. 
holders’ jht least the influential ones’) consumption of their firms’ products is 
usually vfry small, so that price effects are small relative to the income effect 
generated by the firm's profit levels. 


62, For afreview of some evidence on the separation of ownership and control, 
see Scherpr 1980, pp. 32-33, 


then be argued that even if managerial slack invalidates 
the profit-maximization hypothesis, the implications of 
this hypothesis for industrial organization need not be 
erroneous. 

The managerial rewards that will be discussed below 
should be taken in a broad sense. Such rewards can be 
monetary (as in the models below), but could also consist 
of promotions, tolerance of perquisites, prestige, assign- 
ment of cash flow to one's division, and so on, Further- 
more, it should be noted that the principal—agent rela- 
tionship will be discussed mainly in the context of the 
separation of ownership and control between shareholders 
and managers. Clearly, many of the incentive devices 
described below apply to other tiers of the firm's hier- 
archy; indeed, some of them apply more to lower tiers 
than to executive officers, 


2.1 The Basic Incentive Problem 


The agency problem in its moral-hazard form (the focus 
of this subsection) stems from a basic conflict between 
insurance and incentives, On the one hand, the theory of 
optimal insurance demonstrates that the optimal division 
of a pie of a random size (the profit) between a risk- 
neutral party (the shareholders) and a risk-averse one 
(the manager)** has the risk-neutral party bear all the risk, 
if incentive issues are left aside, (See, e.g., Arrow 1970 and 
Borch 1963.) Suppose there is a pie of random size [1 to 
be divided between the two parties, and that this random 
variable is not affected by the parties’ actions. Let 1 
take values in a discrete set 1, < +++ <I, <++<T,, 


63. See Amow 1985 for an informal introduction to the subject, One 
generally distinguishes between “hidden-action’” (or “moral-hazard") models 
and "hidden-knowledge” models. In a hidden-action model, the agent (akes 
some action that is unobservable to the principal. In a hidden-knowledge 
model, he has superior information about some exogenous environmental var 
able. Hidden-knowledge models are subdivided into two groups, depending, on 
whether the agent obtains his information after or before signing the contract 
(the latter case refers to an “adverse selection” situation), Hart and Holmstrim 
(1986) olfer a useful discussion of non-adverse-selection models (as well as of 
incomplete contract theory): For recent surveys of the adverse-selection litera 
ture in a context of procurement or regulation, see Baron 1986, Caillaud et al 
1988, and Sappington and Stiglitz 1987, Most of this section is concemed with 
moral-hazard issues. 


64, With respective objective functions Ex-and Eudz), where x is the ineome 
4 is an increasing and strictly concave utility function, and E(+) denotes the 
expectation over (the random variable.) 


The Theory of the Firm 


with prpbabilities p,,....pj...-.P,_ (where p,>O and 


Ytes pe = 1). Let 1 — w() and w(I1) denote the alloca- 
tions to}the risk-neutral party and the risk-averse party 
when thp realization is Il. The parties’ expected utilities 
are 
EIN — pan] = ¥ pl, — wy) 
T 

and 
E u(w(T)) = ¥ pyuceo,), 

T 


(ie, shpuld have a constant income over all states of 
nature)| This is where the issue of incentives arises. Sup- 
pose thp risk-averse party takes some unobservable action 
that aff¢cts the size of the pie to be divided (in a stochastic 
sense) dnd that this action is costly to him. Think of this 
action hs a level of effort (it could be a more general 
discretibnary choice). Suppose, further, that the risk- 
neutral|party observes only the realization of the pie (the 
level of profit), The risk-averse party, if given an income 
's not depend on this realization, has no incentive 
to exeft effort, because his effort does not affect his 
income} So full insurance conflicts with incentives. Indeed, 
the trafle-off between insurance and incentive objectives 
generally leaves the parties with both suboptimal insur- 
ance afd suboptimal profits, 

Thefe is one case in which this trade-off does not arise. 
Assump that both parties are risk-neutral (in particular, u' 
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is constant), so that the party who takes the unobservable 
action (the agent) does not need to be insured. The other 
party (the principal) can ensure that the agent takes the 
jointly optimal action by “selling” the pie to him—that is, 
the principal receives a transfer price independent of the 
size of the pie, and the agent becomes the residual claimant 
for the remainder of the pie. Because the agent's expected 
income is equal (up to the fixed transfer price) to the 
expected size of the pie, the agent has all the incentive to 
pick the optimal action, ie., the action that maximizes the 
expected size of the pie net of the action’s cost (see 
section 3), The agent bears all the risk under this arrange- 
ment, but this does not matter because he is risk-neutral. 
Residual claimancy for the party that takes the unobserv- 
able action offers a very general solution to the incentive 
problem, and it will be encountered again in chapter 3 and 
especially in chapter 4. However, it is clear that for a 
risk-averse agent the residual claimancy conflicts with the 
insurance objective. 

Finding the optimal incentive scheme when the agent 
is risk-averse is a complex task. (The supplementary sec- 
tion contains a few results.) The following simple exam- 
ples illustrate the issues 


Example 1 


A firm's profit may take one of two values: IT, and 1, 
(with TI, <1,), The firm is run by a manager, who 
chooses between two levels of effort: high (“work”) and 
low (“shirk”), The manager has utility U = u(w— ©) 
when he works and Ll = u(w) when he shirks, where w is 
the manager's wage, u is an increasing, concave function 
(with lim, ._.. «'(w) = +090), and ® (the monetary dis- 
utility of high effort) is strictly positive. The manager's 
objective function is the expectation of «, Working out- 
side the firm, he would get Up = 1(wg). So, to ensure his 
participation, the shareholders must give him an expected 
utility of at least Llp. (wo is called the reservation (net) 
wage). The shareholders’ objective function is the expecta- 
tion of the net profit II — w. 

The technology is as follows: If the manager works, the 
profit is [1 with probability x and equal to T, with 
probability 1 — x. If he does not work, the profit is I, 
with probability y and I, with probability 1 — y. One 
hasO<y<x<1. 

Assume that the manager's contract is chosen by the 
shareholders. 
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First, suppose that the manager's effort is observed by 
the shareholders, who can then choose any level of effort 
they want and impose it on the manager (with the threat 
of a large punishment if he disobeys), Because effort is 
observable, there is no incentive issue; hence, the optimal 
contract calls for full insurance. Suppose first that they 
demand a low level of effort. Optimal insurance implies 
that w, = w) = wo, where the second equality is due to 
the fact that the shareholders neither want nor need to 
give the manager more than his reservation wage. The 
shareholders’ profits are 


IT, + (1 — y)Tl, — wo, 

Now, suppose the shareholders demand a high level of 
effort. Optimal insurance again calls for a constant net 
wage for the manager. so 

wy —O=w,—-O= Wy. 

The shareholders’ expected profits are then 

xt], + (1 — 2), — (we + ®). 


To make things interesting, assume that it is optimal for 
the shareholders to demand‘he high effort: 


afl, + (1 — xT, — (wy + 0) 
> yl, + (1 — y), — wo, 
or 
(x — y)(M1, — 11,) > ®. (2) 


In words, the increase in expected profits exceeds the 
disutility of effort. 

Now consider the more interesting case in which the 
manager's effort is not observable by the shareholders. As 
has been noted, a high effort cannot be induced by a 
constant wage structure, Instead, the shareholders must 
reward the manager when profits are high. Suppose that 
the shareholders still want to induce the high effort, They 
must design a wage structure that satisfies the “incentive- 
compatibility” constraint: 


XU(wW, — D) + (1 — x)u(w, — 0) 


> yulw) + (1 — y)u(w,), (3) 


where w; is the wage paid when realized profits are 1, 
(Equation 3 implies that w, > w,.°*) 

To the incentive-compatibility constraint we must add 
the “individual-rationality” or “participation” constraint: 


xu(w, — D) + (1 — x)u(w, — ®) > (wy). (4) 
The shareholders’ expected profit is then 
x(M1, — wz) + (1 —x (1, — w,). 


It is easily seen that, in the maximization of the 
shareholders’ profits with respect to equations 3 and 4, 
both constraints are binding. (Suppose the incentive- 
compatibility constraint is not binding. Maximization of 
expected shareholders’ profit subject to the participation 
constraint yields full insurance (w, = w;), as has been 
shown, but this wage structure does not satisfy the 
incentive-compatibility constraint. Conversely, suppose 
that only the incentive-compatibility constraint is binding. 
Then the shareholders can reduce w,, say, and keep this 
constraint satisfied; if the decrease in w, is not too large, 
the participation constraint is still satisfied.) Thus, in this 
simple case, the optimal wage structure, given that the 
high effort is to be induced, is obtained from equations 3 
and 4 satisfied with equality: 


xu(w, ~ ©) + (1 — x)u(w, — 9) 
= yu(w 3) + (1 — y)u(w,), (3') 
xu(w, — ) + (1 — x)u(w, — ©) = u(wo). (4) 


The shareholders’ profit is lower under unobservabil- 
ity—equation 4’ and the concavity of « imply that the 
expected wage bill, xw, + (1 — x)w,, strictly exceeds 
Wy +. as figure 3 shows.*® Hence, to induce the high 
effort and obtain the high profit with probability x, the 
wage bill must be higher than under effort observability. 

On the other hand, if the shareholders wanted to induce 
the low effort under effort observability, they would not 
suffer from effort unobservability. The full-information 
wage is constant (w, = w, = wy), and it also induces 
the low effort under unobservability. Thus, the relative 
desirability of inducing the high effort is lower under 
unobservability; that is, the shareholders may be happier 


65. The left-hand side of equation 3 is strictly lower than xx(w;) 
+ (1 —2)4(w,), which is lower than the right-hand side if wo, <u, (recall that 
12H) 
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66. Figure 3 is an illustration of Jensen's inequality, according to which the 
expectation of a concave function of a random variable is lower than the value 
of this function evaluated at the expectation of the random variable 
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XU (Wo # &) + (1 = x)U(w, — ®) = U(Wo) 


W,- 2 Wo 


points:| Effort, if it is not observed, must be induced 
through incentives, The manager's wage must grow with 


e shareholders not wish to induce effort; that is, 
ay tolerate slacking. 


kent on performance (e.g., contracts that read “If 
ent’s performance meets such and such standard, 
i! pay him so much”) cannot be made, as the courts 


procedures may measure only the team’s performance, 
not individual contributions. However, an insider (the 


chief dxecutive officer or a supervisor, say) may be able 


w,-® 


(Wp — ®) + (1=x) (Wy — ®) 


to disentangle these contributions, whereas an outsider 
(a judge) cannot. This applies equally well to the per- 
formance of complementary divisions (manufacturing and 
marketing, for instance) or to that of team workers. 

Now, suppose that in an agency problem II is observ- 
able by the principal but is not verifiable, so the contract 
cannot depend directly on the agent's performance. Can 
the principal be trusted to announce truthfully what he 
observes? A priori, no: Suppose that in the previous model 
the optimal contract induces effort when profit is verifi- 
able. When profit is only observable, the principal has an 
incentive to claim that it is low (IT,) even when it is high 
(I1,), since w, < w . There is a simple conflict of interest 
for the principal. 

The picture changes dramatically when the principal 
oversees many agents (division managers, workers, etc.). 
To simplify, consider a large number N of agents, each of 
whom produces an observable but unverifiable profit. As 
in the previous model, the probability that the individual 
performance is IT, rather than IT, is x or y, depending on 
whether or not the agent exerts effort. The probabilities 
are independent.** Consider the following commitment 
by the principal: “I will pay a wage w, to x percent of my 
agents (the ones I announce to be the most productive 
ones), and a wage 1, to the rest,” where w, and w, solve 
equations 3’ and 4’ (ie, are the optimal wages under 
verifiability). Clearly, the total wage bill, 


Nixw, + (1 — x)w,], 


| = 
67. In the model discussed above, I need not represent the firm's profit; it 
could alp represent an employee's performance. 
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68. Correlation between the production processes is not required here, as it is 
in the case of the tournament mechanism given below: 
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is fixed, and the principal does not have an incentive to 
misrepresent the individual performances.°? Conversely, 
if the agents all exert effort, they know that x percent of 
them will yield profit IT, (by the law of large numbers) 
and will receive wage w,. Those yielding profit 11, will 
receive wage ™,. Hence, the incentive-compatibility con- 
| straint and the individual-rationality constraint are satis- 
fied. With many agents, the principal can be given the 
authority to choose rewards, because he can commit 
himself to an overall reward policy. Thus, verifiability is 
obtained indirectly.”° 


Remark The rewarding of observable but unverifiable 
performance through authority is a more general phe- 
nomenon. Here, the fixed size of the aggregate reward 
prevents authority from becoming arbitrary by removing 
the principal's incentive to misrepresent the agents’ per- 
| formances. An alternative but similar mechanism, which 
| works even with a single agent, exists if the principal has 
| a reputation to defend, For instance, an employer who has 
| @ reputation for treating his employees fairly (rewarding 
| them according to their performance)—that is, an em- 
| ployer who does not abuse authority—is able to offer 
| his employees better incentives, and therefore may be 
| reluctant to milk his reputation by not rewarding them 
| fairly simply to increase short-run profit. 


| 
| Limited Punishment and Managerial Rents 
| 


|In the previous model, it was shown that the manager's 
Jindividual-rationality constraint is binding. The argument 
|was that if it were not binding, the shareholders could 
reduce the wage ww, a bit; this would not impair incentives 
jand would still induce the manager to participate. In some 
jcircumstances, however, reducing the wage may not be 
|possible. Suppose that, because of limited liability and 
|laws against slavery, the harshest punishment that can be 
imposed on the manager is for him to receive Wp (so 
0; > wo for all i is a new constraint in the design of the 


wage structure). One can interpret wy as the equivalent in 
terms of utility of the wage that the manager can get 
elsewhere net of the search costs.’! Alternatively, one 
might imagine that the manager becomes infinitely risk- 
averse under Ww; Wo could be like a subsistence level, a fall 
slightly below which would yield utility —oo (“death”) to 
the manager. 

Because w, and w, necessarily exceed wy (weakly), and 
because the manager can always choose not to work, the 
participation constraint is automatically satisfied. Suppose 
the shareholders still want to induce effort (which will be 
the case if 1, — TI, is large enough). To do so they need 
to impose a wage differential between the two levels of 
Profit: w> > w, > wo. Again, because the manager can 
always choose not to work, his expected utility is no less 
than 


vu(w) + (1 —y)u(wo,) > u(wo). 


The participation constraint is not binding, which means 
that the manager enjoys a rent within the firm.” (Here 
%, is equal to wo, and w, is then given by equation 3!) 

The possibility of rents (Calvo 1977; Calvo and Wellisz 
1978, 1979) underlies much of the efficiency-wage hy- 
pothesis for involuntary unemployment, according to 
which workers inside a firm are given a rent relative to 
unemployed ones as a way of giving them work incen- 
tives in the face of limited punishments (see, e.g., Shapiro 
and Stiglitz 1984).”* 


Example 2 


The purpose of this example is to confirm our earlier 
intuitions on a simple example with a continuous choice 
of effort.’* The manager chooses a level of effort e on the 
real line. His utility is equal to u(w — Re?/2), where Ris a 
parameter of disutility of labor and u satisfies the assump- 
tions of example 1. His net reservation wage is wp, so the 
participation constraint is 


2. As long as he does not collude with some of the agents. 


0. A similar argument is made in Bhattacharya 1983, Carmichael 1983, and 
falcomson 1984. It still holds with a finite number of agents (ie, not a large 
bre), though not in such a stark way. 


[1. There are some subtle issues of timing of production and profit accounting, 
fiere, but a dynamic model would complicate the argument. Note also that the 
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72. Technically, the existence of a rent is associated with the impossibility of 
imposing punishments beyond a given level, For a sufficient condition to rule 
‘out rents, see the earlier reasoning and (more genetally) proposition 2 in 
Grossman and Hart 1983. 


73. See Milgrom 1986 for a further discussion of rents attached to jobs. 


74. This example is taken from Parsons 1984, where it is attributed to Berhold 
(1971) and Stiglitz (1975) 
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— Re?/2) > ultwoo), (5) 


ing that randomness occurs after the choice of effort, 
ugh in the current model it could occur and be 


the shareholders can observe effort, the optimal con- 
involves a fixed wage w = i. For a given effort 


imization of the shareholders’ expected profit 
E(e]+ ¢ — wo — Re?/2) = e — wo — Re*/2 


yields e* = 1/R (assuming wo < 1/2R). Suppose that 
effort is not observable, but profit is. 

e will restrict ourselves to linear incentive schemes, ”* 
so|let w(Il)= a+ bM. Now determine the optimal 
scheme in this class. The manager's expected utility is 


Euka + be + be — Re*/2), 


ximization with respect to e yields e = b/R. Effort 
gr@ws with the slope of the incentive scheme, and for 
1 the manager is residual claimant and e = e*. The 
manager's expected utility is, thus, 


Ex(a + b?/2R + be). 


shareholders’ expected net profit is 


hb 
at R + ix) > U(wWo). 


Substituting a into the participation constraint (which is 
binding here) yields 


E —nyo Pay = u(t), (6) 
u RR 1) = ul). 


Tt is clear that if [1° is to be maximized, the shareholders 
must choose b so as to maximize the left-hand side of 
equation 6. Hence, we have 


Te 


(Ex') R Ls 


+ E(u'e) = 0. (7) 


If the manager is risk-neutral, u’ is a constant independent 
of ¢, and equation 7 yields b = 1. This confirms the princi- 
ple of residual claimancy for risk-neutral agents, If u’ is 
strictly concave, we claim that b lies between 0 and 1. 
Suppose that b < 0. Then the first term on the left-hand 
side of equation 7 is strictly positive. The second term, 
which is equal to the covariance of u’ and ¢ (recall that 
Ez = 0), is non-negative, so equation 7 cannot hold, The 
reason the covariance is non-negative is that for b <0 
the manager's income is nonincreasing in ¢, so that his 
marginal utility, which is a decreasing function of income, 
is nondecreasing in ¢. The reasoning for b > 1 is similar, 
but here the first term is negative, and the covariance is 
also negative. 

We thus infer that the optimal linear wage structure is 
a profit-sharing scheme—a compromise between a fixed 
wage (b = 0), which yields optimal insurance, and residual 
claimancy (b = 1), which yields optimal incentives. 


Remark In the above example, the manager was assumed 
to be rewarded on the basis of profit. In practice, manage- 
rial compensation is contingent on the value of the firm as 
well as on its current profits. Lewellen (1971) documents 
that a firm’s stock options are often a large portion of its 
managerial portfolios. The general idea behind rewarding 
managers on the basis of stock value rather than profits is 
that profits are a very garbled measure of managerial 
performance (Lewellen 1968, chapter 4; Grossman and 
Hart 1980, p. 48). For instance, a profitable investment 
reduces current profits without reflecting managerial slack 
or ineptitude. But such factors, which are unverifiable 
because of accounting manipulations, may be observable 


75, This is a strong restriction. The supplementary section at the end of this 
chapter looks at optimal nonlinear schemes. Holmstrom and Milgrom (1987) 


and Laffont and Tirole (1986) offer two different special contexts in which the 
‘optimal incentive scheme indeed turns out to be linear. 
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by the market and thus may be reflected in the firm's 
valuation. Stock options in particular are seen as an incen- 
tive for an otherwise transient manager to care about the 
firm's future profits as well as its current profits.”° 


2.2 Limits to Discretion 


In the preceding subsection we considered the use of 
performance measures to limit managerial discretion. In 
practice, the shareholders may want to use other pieces of 
information as well. We will look at other factors that 
estrict managerial discretion even further, 


ardstick Competition 


n agent's individual performance, even if it is verifiable, 
s only a garbled measure of the agent's effort (see exam- 
les 1 and 2 above). For instance, a firm's low profit 
ay be due to a decrease in demand or an increase in 
‘osts rather than to managerial slack. Such effects can 
e detected, to some extent, by comparing the agent's 
erformance with that of other agents placed in similar 
tonditions,’7 

To see how yardstick competition works, consider 
xample 1 above. Suppose that the shareholders oversee 
‘© managers in charge of two similar divisions. The 
lhareholders’ profits are equal to the sum of the profits 
enerated by each manager, net of the expected wage bill. 
s before, the probability of generating profit I, rather 
an TI; is x or y depending on whether the manager 
orks or not. Furthermore, the uncertainties facing the 
anagers are perfectly correlated, in that the same level 
f effort yields the same profit. Thus, if both managers 
‘cose to work, the realized profit is either 11, for both 
ith probability x) or I, for both (with probability 
— 2), and similarly when they both choose not to work. 
me may have in mind, for instance, the case of two 
visions serving two distinct geographical markets 
hose demands are perfectly correlated. 

In these circumstances, the shareholders can use the 
‘plowing contract: “If both managers reach the same 


level of profit (be it IT, or I), both receive the full- 
information wage w, = w; = wy + ®; if profits differ, 
the high-profit manager gets w = wy + ® and the low- 
profit manager gets heavily punished.” Each manager's 
wage thus depends on the other manager’s performance 
as well as on his own. Clearly, both managers exerting 
effort is an equilibrium of the subsequent game between 
the managers. If a manager is expected to work, and 
yields the high profit, the other manager automatically 
reveals that he did not work by obtaining the low profit. 
He cannot attribute his poor performance to “adverse 
circumstances,” and he is heavily fined.78 


Exercise 2"* In this exercise, which concerns sole sourcing 
versus dual sourcing, we build a simple model in which 
the managers’ objective function is such that monetary 
incentives are rather ineffective. A firm has a project of a 
given size. The cost of the project is C= 8 —e, The 
variable f is random on [8, f] with expectation Ef. The 
variable e denotes the effort exerted by the manager 
assigned to the project. A manager has utility function 
Uw, e) = u(w) — O(e), where & > 0,” > 0, &(0) > 0, 
and 


(wo) = 4 
Ee are Ui SB) 


ifw<@ 
ifw> ow. 


Thus, @ can be interpreted as a subsistence wage. dis a 
“small,” positive parameter; mathematically, 2 < ®'(e) for 
all e. The manager is infinitely risk-averse over the states 
of nature 8, so he is only interested in his utility in the 
worst state of nature: ming U(zw, e). The manager learns fi 
after signing the contract and before choosing e. The 
principal observes C, but not f or e. So the wage structure 
is a function of C, w(C), and the manager's objective can 
be written 


min (mmaxiatocp =) 1). 
p\e 


Let Up denote the manager's reservation utility, and let 


}. OF course, to preserve incentives it is important to prevent the manager 
pm diversifying away the risks associated with the firm's performance. Stock 
iftions that cannot be sold serve this purpose 


1] The theory of yardstick competition and touimaments was developed 
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by Lazear and Rosen (1981), Green and Stokey (1983), Nalebuff and Stiglite 
(1983), and Shleifer (1985). 


78, This suggests the possibility of multiple equilibria. Indeed, with our for- 
mulation, both not working is also an equilibrium. On the issue of multiplicity: 
and incentive design to prevent it, see Mookherjee 1984, 
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0 be defined by u() — ®(e*) = Up. The principal 
ts to minimize the expected cost of the project. 
i) Show that if 8 and/or e were observable by the 


ii) Under asymmetric information, show that the opti- 
mal contract is 


=o ifC<p-e 
<®_ otherwise 


an that the expected cost of the project is @ + f — e*, 
iii) Suppose that the project can be given to two man- 
s. The cost to the principal (net of the wage bill) is 
min(C,, C,), where C, = 8 — e; and ¢; is manager i's effort 
(i F 1, 2). That is, B is the same for both managers. Show 
tht an optimal contract is 


= ifG=G 


wil, G) 4 <i FG >G 


=B+ MC \(G-—CGVA ifG<G, 


It is concluded that the principal prefers dual sourcing to 
sdfe sourcing if and only if @ < B — Ef. Interpret. 


Ramark The preceding example and the exercise as- 
ed perfect correlation between the agents’ technol- 
ogies. But the idea of yardstick competition carries over 
tq environments with imperfect correlation (a more rea- 
sqnable assumption). Indeed, Baiman and Demski (1980) 
ar}d Holmstrom (1982) have used the Holmstrém-Shavell 
sufficient-statistics result (see section 3) to show that 
ai} agent's optimal wage structure depends only on his 
pfrformance if and only if performances are independent. 


Rimark Yardstick competition is somewhat analogous 
the use of authority to reward agents when their per- 
fdrmance is observable but not verifiable, in that both 
ist on a comparison of the agents’ performances. How- 
eyer, the two arguments differ in spirit. Yardstick competi- 
tipn relies on the correlation of the agents’ technologies, 
bht not on the nonverifiability of performances; further- 
jore, comparison can be made with outside parties, such 


as competing firms. With authority the result stems from 
nonverifiability, and does not rely on the correlation of 
the technologies, and the comparison is made within a 
group of agents overseen by a principal. 


The potential applications of yardstick competition are 
numerous. The performances of managers of divisions 
facing similar cost or demand conditions may be com- 
pared by chief executive officers. Similarly, the rewards 
of one company’s managers can be made contingent 
on the performance of a competitor's managers. More 
generally, managerial rewards can be based on average 
industry profit. The Department of Defense and many 
private firms sometimes use dual sourcing to procure their 
supplies, even in spite of the possible loss in returns to 
scale. Medicare pays hospitals a fixed fee for treating all 
patients within a diagnostically related group. The size of 
this fee is based on the average cost of treating patients 
in this group at comparable hospitals (Shleifer 1985). 

Yardstick competition also has its limits. The units 
that are to be compared may face different conditions 
(e.g., the correlation between the situations may be fairly 
imperfect), Furthermore, their performance may be gar- 
bled by accounting idiosyncrasies or measurement errors. 
Last, the managers’ performance depends on the assets 
they have inherited. (Although it may not be a problem 
in theory, this effect requires yardstick competition to 
be more sophisticated and, therefore, makes it more 
unlikely.) This may explain why there is little yardstick 
competition in the electricity-generation industry (see 
Joskow and Schmalensee 1986).’° 


Takeovers 


Manne (1965) and Marris (1964) have argued that failure 
to maximize profits lowers the stock value of a firm, and 
that it induces outside entrepreneurs (raiders) to buy the 
firm, replace its management, and direct the firm toward 
profit maximization. The threat of such takeovers serves 
to discipline managers. 

But why should managers be concerned with the threat 
of a takeover? For the argument to hold, managers must 
suffer greatly from the takeover. This could be true either 
because managers are immediately punished when their 


79, See Antle and Smith 1986 for an empirical assessment of the use of 
yardstick competition in executive compensation. 
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firm is taken over (since a takeover is an indicator of poor 
management) or because, by being removed, they lose 
the rents they had enjoyed within the firm. The first 
reason does not seem convincing; because of limited lia- 
bility and a prohibition on slavery, direct punishments 
are hard to impose, Actually, far from being punished, 
managers usually receive princely amounts of money 
(“golden parachutes”) when they are fired after a raid.*° 
The threat of losing rents attached to managerial jobs 
is a more credible explanation. One such rent is prestige 
or reputation, Another may be on-the-job leisure (slack) 
due to the asymmetry of information between share- 
holders and managers. To the extent that mismanagement 
increases the likelihood of a takeover, the fear of losing 
their rents may make managers less prone to slack. For- 
mulations of this idea have been put forth by Scharfstein 
1985a) and by Demski, Sappington, and Spiller (1987).5" 
Takeovers, however, have their limits. Costly informa- 
tion must be collected about the firm's inefficiencies and 
areas for improvement. Outsiders have an incentive to 
collect information and spend the takeover costs only 
if they can derive substantial profits from the takeover. 
Grossman and Hart (1980) point out a potential free-rider 
problem that may impair this incentive: In the event of 
takeover, a shareholder may not want to tender his 
ishares, because if he keeps them he can enjoy the increase 
iin the stock price brought about by the raid. On the other 
and, the raider can make a profit only if the tender price 
f shares is lower than their post-raid price. Hence, he 
annot both buy the shares and make a profit on them. 
There are ways out of the free-rider problem. Dilu- 
ion—a provision in the firm's constitution allowing a 
uuccessful raider to sell part of the firm’s assets to another 
ompany owned by the raider at disadvantageous terms 
for minority shareholders or to issue himself new shares— 
s tantamount to a reward to the raider, and encourages 
akeovers (Grossman and Hart 1980). Another possibility 
is that the raid may be undertaken by a large shareholder 
f the firm. Even if the other shareholders free-ride on him 


(by not tendering their shares), a large shareholder at least 
enjoys the increase in the value of his own shares (Shleifer 
and Vishny 1986), 

These anti-free-rider factors are themselves limited, On 
the one hand, because dilution is basically a gift to the 
raider, incumbent shareholders may be reluctant to allow 
it on a large scale, Furthermore, the U.S. courts have 
restrained its use. On the other hand, the large share- 
holders internalize only the increase in the value of their 
shares. They do not take the positive externality on the 
other incumbent shareholders into account, so their incen- 
tive to monitor the firm and to undertake a takeover may 
be too small. 

Potential resistance by the current management to a 
raid imposes a second limit on the efficacy of takeovers. 
Managers can make the firm unattractive to the raider 
through antitrust litigation or “poison pills.”*? If this 
does not work, they can collude with the raider and buy 
his shares of the firm (if any) at a substantial premium 
over the market price in exchange for his signing a stand- 
still agreement, which prohibits him from owning shares 
of the firm for a certain period of time. The other share- 
holders may well be losers in such a "greenmail” maneuver, 
because the takeover does not take place (management is 
not removed) and the firm purchases the raider’s shares at 
a high price (however, see Shleifer and Vishny 1984). 
Finally, managerial resistance can be reduced (for instance, 
by offering “golden parachutes” to removed managers), 
but only at a substantial cost. 

These effects may, to some extent, explain Scherer’s 
(1980, p. 38) observation (made before the recent wave 
of takeovers) that “the available evidence provides at 
best only weak support for the hypothesis that takeovers 
generate an effective disciplinary mechanism against de- 
partures from profit maximization.” 

A takeover threat may have perverse incentive effects, 
First, it lowers the managers’ incentive to make long-run 
investments, as they may not be around to reap the 
benefits, That is, managers are led to behave myopically 


In some instances the takeover may hurt the managers’ future careers by 
fonveying bad news about their abilities, The sanction is then imposed by the 
arket for managerial talent and is delayed. (For models of managerial careers, 
¢ the next subsection.) 


‘1, Demski et al. present their formulation in the related context of second 


jourcing (i.e., the replacement of a supplier by another), See also Anton and Yao 
987 and Caillaud 1985. 
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82. For instance, they may purchase one of the raider's competitors, making the 
raider’s takeover subject to antitrust laws, Or, by purchasing another firm (even 
‘one not related to the raider’s activity), they may increase their own firm's debt 
enough to scare off a cash-constrained raider. "Poison pills” are preferred stock 
rights that are inactive unless they are triggered by a tender offer for a large 
fraction of the firm. They are somewhat similar to an entry fee'to be paid by 
the raider, 
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\Laffont and Tirole 1987). Second, it destroys the man- 
pgers’ job stability and exacerbates their career concerns, 
hich may lead to managerial decisions that are con- 
rary to the firm's interest (Hermalin 1987). Third, it 
bhortens the term of the relationship between managers 
hnd workers, and may prevent the development of trust 
between them (Shleifer and Summers 1987). 


lanagerial Incentives: The Dynamic Perspective 


other bound on the managers’ discretion is the concern 
‘or their own careers, both within the firm and outside the 
irm, The arguments presented here relate mainly to the 
interaction between adverse selection (How efficient or 
trustworthy is the manager?) and moral hazard (How hard 
Hoes he work?). 

Within a firm, a manager who has performed poorly 
jnay not be trusted to do well in the future. His prospects 
‘or internal promotion may be weak. To formalize this, 
kome authors have focused on the repeated aspects of the 


arket wage), as long as the employee has performed 
jwith high quality before. If the employee “cheats” and 
turns in a low-quality performance, the firm stops offering 
him the rent (for instance, does not promote him or does 
jnot increase his salary). The threat of losing this rent puts 
some discipline on the employee. Mathematically, the 
formalization of this idea involves either the theory of 
jsupergames or the theory of reputation under asymmetric 
information (see chapters 2, 6, and 9). The idea is similar 
|to the one in chapter 2 where consumers stop patronizing 
Ja firm that starts producing low-quality products.** 

Let us now turn to the possibility that the manager will 


leave the firm. The threat of outside offers may discipline 
the firm and force it to treat its managers fairly when the 
quality of their work is observable but not verifiable. 
Suppose that a manager turns in a high-quality job, which 
is a signal of the manager's ability as well as of his effort, 
but is not rewarded by a wage increase. If his performance 
is observed by other firms, the latter may be able to bid 
him away. Because of this threat, the firm must reward the 
manager fairly. Consider a professor at a university. Let 
the quality of his work be the quality of his research and 
the richness of his interactions with colleagues and stu- 
dents. Now, this quality may be observable by the profes- 
sion but hard for a court to assess.** His threat to move 
to another university tends to equalize his position within 
and outside his university. This idea, pioneered by Fama 
(1980), was studied further by Holmstrém (1982a), who 
combined the distinction between observability and veri- 
fiability and the outside pressure mechanism. 

The possibility of getting good opportunities outside 
his present firm, like that of being rewarded within the 
firm, certainly gives a manager incentives to perform 
satisfactorily. When the manager's internal or external 
supervisors observe only his performance, any amount of 
shirking on his part may be mistaken for a lack of trust- 
worthiness or ability, and thus may hurt his career. 
Holmstrém shows that, in such circumstances, a manager 
at the beginning of his career may work even harder than 
is socially optimal.** 


Remark Two other mechanisms that enable a firm to 
reward its employees’ observable-but-unverifiable_per- 
formance by fiat, but fairly, have already been described. 
One was based on the firm's commitment to reward a 
given percentage of its employees; the other was based 
on the firm’s reputation. The outside pressure mechanism 
may work better for top managers than for other em- 
ployees, because the former have more outside visibility 
than the latter. 


]83. For elaborations of this idea in the context of intemal organization or 
| business relations, see Telser 1980, Kreps 1984, Bull 1985, Hart and Holmstrom. 
1987. and Macleod and Maleomson 1986, In the context of business relations, 
| Kreps posed the further question of why transient managers of a firm would 
| want to sustain the firm's reputation. Cheating on an implicit agreement yields 
J current benefit, whereas some of the future losses associated with the loss of 
reputation will be incurred after the managers have left the firm, Kreps notes 
|that if the managers own the firm, they internalize the full effects of their 
| decisions, because the value of the firm reflects future as well as current profits 
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84. OF course, the courts might hire experts in the profession to assess the 
quality (which they sometimes do), But the legal costs would generally be 
incommensurate with the issue at stake 


85. Gibbons (1985) shows that it may be optimal to reduce the manager's 
outside visibility at the beginning of his career in order to reduce this effect. He 
argues that the junior members of a law firm often have such a reduced 
visibility. Wolfson (1985) demonstrates that a reputation effect alleviates the 
moral-hazard problem somewhat in oil drilling ventures. 
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Supervision 


Except in the discussion of takeovers, we have mainly 
taken the principal's information as given and studied the 
optimal use of this information to control the agent. We 
now turn to the internal incentive to monitor. 

Alchian and Demsetz (1972) argue that nonseparabili- 
ties or increasing returns to scale are essential to the 
understanding of organization design. The product of a 
team exceeds the sum of the products of its members in 
isolation; however, Alchian and Demsetz note, team pro- 
duction may hinder the metering of productivity and 
rewards. For instance, unless each worker is accountable 
for a particular part of production (as is the case with 
piecework), accounting data measure only the output of 
he shop. In one of their examples, Alchian and Demsetz 
how that it may be hard to disentangle the relative 
rformances of two workers who jointly lift heavy cargo 
into trucks. At a more aggregate level, a high level of 
ales of a product may be due to a good design, to the 
igh quality of the product, or to an appropriate market- 
ing campaign, and there may be no clean measure of each 
unctional division's contribution. 

Nonseparabilities create a free-rider problem among 
he members of a team. Suppose, for instance, that two 
mployees in a team share equally every extra dollar the 
eam creates. Then each employee, when generating $1 
lor the team, receives only 50¢. This means that each 
mployee has too small an incentive to contribute to the 
leam’s production. One solution is to increase the mone- 
jary incentives further (see the remark below). The other 
olution is the introduction of a third party (a monitor or 

supervisor) to measure the individual performance of 
ach employee. How is the monitor monitored? What are 
is incentives to supervise the employees? Alchian and 
setz suggest that the monitor be given a title to 
e net earnings of the team (net of payments to other 
inputs). In the jargon of incentives theory, the supervisor 
ik made the residual claimant or sink. At the margin, he 
lapures any extra profit of the team. He thus has a 
rong incentive to exert effort to measure the employees’ 
individual performances. 


In summary: Alchian and Demsetz’s organization (or 
“firm’) is a particular policing device utilized in the pres- 
ence of team production, Indeed, among the bundle of 
rights associated with ownership of a firm, Alchian and 
Demsetz list the right to be residual claimant and to 
observe the behavior of employees.®° 


Remark Holmstrom (1982b) offers an alternative theory 
of the firm based on team production and the impossibil- 
ity of measuring individual performance. He argues that 
one of a corporation's roles is to break the rule that the 
benefits of an organization must be split among em- 
ployees (the way it is done in a partnership). The idea is 
as follows: To face the right incentives in his production 
decisions, an employee must be the “residual claimant” 
for these decisions; if a decision yields one extra dollar to 
the firm or unit, the employee must receive that extra 
dollar. Consider a two-employee unit, and suppose that 
only the unit's performance (equal to the sum of the two 
employees’ performances) is observable, Then, each time 
the unit’s profit grows by $1, each employee must be 
given $1. The marginal distribution of profits must be $2 
for each extra dollar earned by the unit. This is possible 
only if there exists a “source” (shareholders? other units?) 
that “breaks the budget constraint” at the margin.®? The 
Holmstrom and Alchian-Demsetz theories differ except 
for the starting point (the impossibility of disentangling 
individual contributions at an accounting level) and for 
the conclusion that the budget constraint of a team must 
be broken. Alchian and Demsetz rely on supervision by a 
third party who is made a sink in order to give him 
the incentive to monitor (at the margin, the employees 
receive $0 for every $1 they create; i.e., they havea fixed 
wage). In contrast, Holmstrém disciplines team members 
through monetary incentives provided by a marginal 
source. 


Williamson (1975, p. 49) argues that the importance of 
nonseparabilities in production should not be overstressed. 
Because a monitor can supervise only a limited number of 
employees, the theory can at best explain the organiza- 


Other listed rights are those to be the central party common to all con- 
{facts with inputs, to alter the membership of the team, and to sell these rights. 


P. For an argument that follows Holmstrém's see exercise 4,4. Note that the 
of a source requires that the employees be unable to collude against this 


source. Because every time an employee increases output by $1 the other 
employee also receives $1. the employees have an incentive to get together to 
expand production beyond its noncooperative level. 
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ion of small groups. In large firms, supervision must be 
lelegated.** 

This brings us to an interesting class of hierarchical 
iodels which were pioneered by Williamson (1967) and 
y Calvo and Wellisz (1978, 1979) and developed further 
y Rosen (1982) and by Keren and Levhari (1983). Sup- 
‘ose that the firm is organized according to a pyramidal 
tructure, as shown in figure 4. Level 1 is the productive 
ier of the firm (the workers). Level-1 employees are 
upervised by level-2 employees, presumably because of 
jonseparabilities in production. A level-2 employee does 
jot supervise all level-1 employees, because the quality 
pf supervision decreases with the span of supervision. 
in turn, level-2 employees’ incentives to supervise are 
rovided by level-3 supervision, Level 3 is assumed to be 
jomposed by a single agent (or unit), who is the residual 
laimant for the firm’s profit net of wage and input pay- 
ents. For instance, level 3 can be the shareholders 
respectively, the executive officers) and level 2 the 
xecutive officers (respectively, the division officers). An 
ttractive property of these models is that the horizontal 
nd vertical sizes of the firm are not fixed. The span of 
ontrol of each employee and the number of tiers are 
jupposed to be chosen by the top tier to maximize pro- 
its. What may then put a lid on the firm’s size is the 
leterioration of supervision as the firm grows. More 
orkers require more level-2 supervisors, and the super- 
ision of level-2 employees by the top tier worsens. 
Iternatively, an extra layer can be created between the 
op tier and level 2; but this addition is costly.) Calvo and 


Wellisz also derive some interesting implications for the 
wage structure within the firm from this type of model. 

In Tirole 1986b it is argued that the exercise of super- 
vision and authority in an organization is limited by the 
possibility of collusion between groups of its members. 
An employee obtains power over another when given the 
right to evaluate the other's performance (supervision) or 
to make decisions in unforeseen contingencies that affect 
him (authority). This power gives rise to the possibility of 
collusion between the two parties, which is enforced by 
side-transfers (promise of a similar counter favor, money, 
evolution of their personal relationship, etc.). Organiza- 
tional design is then partly geared toward preventing 
collusion by recognizing a supervisor's incentive to act as 
an advocate for rather than as a prosecutor of the super- 
visee, and by limiting authority through the imposition of 
bureaucratic rules, 


Product-Market Competition 


As Scherer (1980, p. 38) notes, 


When forced into the trenches on the question of whether firms 
maximize profits, economists resort to the ultimate weapon in 
their arsenal: a variant of Darwin's natural selection theory. 
Over the long pull, there is one simple criterion for the survival 
of a business enterprise: Profits must be nonnegative, No matter 
how strongly managers prefer to pursue other objectives, and 
no matter how difficult it is to find profit-maximizing strategies 
in a world of uncertainty and high information costs, failure to 
satisfy this criterion means ultimately that a firm will dis- 
appear from the economic scene. 


For instance, Winter (1971) suggests that a competitive 
firm that makes inefficient decisions (about techniques, for 
instance) incurs losses because it cannot simply transmit 
its extra costs to the consumers (the market price is taken 
as given). The firm is thus led to search for new and better 
decisions in order to survive. Hence, firms in a competi- 
tive environment are more hard-pressed to reduce costs, 
and end up being more efficient. This is clearly a rather 
realistic story; however, as Hart (1983) notes, “a question 
which Winter does not analyze, at least formally, is why 
firms choose inefficient techniques in the first place.” The 


8. Indeed, most of the supervisors in organizations have only limited stakes 
if them. In particular, they are usually far from being the residual claimants for 
the teams they supervise. 
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answer to this question may lie in the problems of delega~ 
tion and monitoring mentioned above.*? 

The effects of the threat of bankruptcy and of competi- 
tive pressure on managerial incentives have not yet been 
formulated in a satisfactory way. However, some of the 
effects of competitive pressure on incentives (not those 
related to the survival issue) have been studied. 

One obvious effect stems from the possibility of yard- 
stick competition. The shareholders of a competitive firm 
can base managerial rewards on the competitors’ profits 
or on the market price, which would not be possible were 
the firm a monopoly. 

The effect of product-market competition on internal 
incentives becomes more subtle when outside data, such 
as competitors’ profits and the market price, are not avail- 
able to the firm. Hart (1983) has shown that the form of 
competition in the product market still has an influence on 
internal control when the owners of the firm observe only 
that firm’s performance.?° 

In Hart's model, competition operates through the 
variability of the market price, which, together with a 
cost shock, determines the variability of the firm's profit 
and therefore determines the extent to which the firm's 
managers can manipulate the owners’ uncertainty in order 
to engage in slack. Roughly, a profit that is exogenously 
more variable leaves more leeway for misrepresentation 
by the managers and less opportunity for control by the 
shareholders. Hart considers a competitive industry with 
two types of firms; managerial firms (in which the share- 
holders delegate the decision power to managers and 
are therefore exposed to the above-mentioned control 
hazards) and entrepreneurial firms (which are run by the 
entrepreneurs themselves).?! 


Intuitively, when the marginal cost of production (which 
is perfectly correlated across the firms) is low, entre- 
preneurial firms expand their output. The managers of 
the managerial firms, if they do not respond much to 
monetary incentives, take advantage of the good times to 
slack. If the proportion of entrepreneurial firms increases, 
the output is thus more sensitive to a cost decrease. This 
reduces the market price in good times, thus mitigating 
the influence of the low cost on profit. In particular, the 
managerial firms’ profits become less sensitive to outside 
uncertainty when the proportion of entrepreneurs grows. 
This makes their control by shareholders easier, which 
results in less slack. To the extent that entrepreneurs 
are a symbol of competition,’? more competition in the 
output market leads to less slack in managerial firms. 
Unfortunately, this result is sensitive to the description 
of the managers’ utility function, as Scharfstein (1985b) 
shows. If managers do react to monetary incentives suffi- 
ciently, a larger proportion of entrepreneurs increases 
slack in managerial firms.°* 


An Application: The M-Form Firm 


The analysis of managerial incentives may shed some 
light on the emergence of the so-called multidivisional- 
form (M-form) firm.°* 

Recall from section 1 that the technologically rational 
organization of a firm is the unitary form, with its special- 
ization by function. However, this type of organization 
collapsed with the horizontal expansion of firms, Ac- 
cording to Chandler (1966), this collapse is mainly due to 
the loss of control by the top management. This might be 
explained as follows. To control the functional divisions, 


89. Another possibility, unrelated to the issue of incentives, is that firms must 
try techniques before learning their efficiency. Hence, a firm may enter a 
market, discover it has inherited an inefficient technique, suffer losses, and exit 
the market. In the long run, only firms that have developed efficient techniques 
survive, Jovanovic (1982), Lippman and Rurmelt (1982), and Hopenhayn 
(1986) derive some interesting conclusions from such dynamic competitive 
selection models, 


90. This material is more advanced than the rest of this chapter. The reader 
may wish to omit it on the first reading, 


91, The division between the two types of firms is explained by a fixed cost of 
becoming an entrepreneur 


92. Note, however, that no firm has market power. 


93. Readers familiar with the principal-agent literature will understand the 
following explanation: Obtaining separation between the different states of 
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nature (ie. obtaining variations of one's firm’s profits when the cost parameter 
varies) is very costly to the shareholders if the managers respond little to 
monetary incentives (because much money must ther be spent to achieve 
separation), In particular, as in Hart's model, when the managers are infinitely 
income-risk-averse, they are best required to reach a profit target—i.c., to 
bunch (reach the same level of profit in all states of nature), Bunching means 
that in good states of nature the managers slack more. In contrast, when 
managers do react to monetary incentives to a decent extent, the optimal 
incentive scheme leads to separation, In that case, as is well known, there is no 
distortion of effort in the best state of nature (at least if the disutility of effort 
has a monetary representation: that is, « = u(w — Re}), and there js slack in 
worse states of nature. This reverses the result 


94. The following is but a naive attempt to promote incentive theory as a 
partial explanation of the M form. For a richer approach see Chandler 1966 and 
Williamson 1975. 
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the top management can basically use one of two methods: 
g each functional division for good performance 
(ie., baging incentive schemes on output) and supervising 
the divisions directly in order to assess individual contri- 
butiong (i.e,, to measure inputs). The first method clearly 
faces tHe accounting problem of separating the contribu- 
tions of the various divisions. The sales of a product or 
the prdfits of the firm depend on the quality of each 
divisiom’s performance, which may be hard to measure. 
This gives rise to an Alchian-Demsetz-type team prob- 
lem. THe second method can be employed only if the firm 
is small. A loss in the supervisory possibilities of top 
manag¢ment may have accounted for the difficulties of 
the unifary-form firm in an expansionary phase. 

The} multidivisional form emerged in the 1920s and 
becamq prevalent after World War Il. It consists of 
organiding the firm into quasi-firms resembling “scaled 
down, fpecialized U-form structures” (Williamson 1975, 
p. 136}, These divisions are defined by product, brand, 
or geopraphic lines, and are fairly autonomous. See, for 
example, figure 5. 

In the M-form firm, a relatively precise measure of divi- 
sional performance is available. And, indeed, the role of 
the gereral office (top management) is to audit and allocate 
resourdes among the competing divisions.°* Within a 
divisioh, by contrast, the supervisory mode is more prev- 
alent, And that allows some assessment of the relative 
contri 


jutions of functional subdivisions. 

This rough analysis leaves an important question un- 
answeled; If divisions are quasi-firms, why should they 
be orghnized within a single structure? Why aren’t they 
separate legal entities? Indeed, Williamson views the M- 
form firm as a “miniature capital market”; however, he 
argues| further that the general office has better auditing 
capabilities, as well as better takeover capabilities, than 
the capital market (1975, pp. 146-148). His argument is 

to incomplete contracts. 
2.3. Doubts about the Neoclassical Methodology 


Progress made since the early 1970s has shown that the 
neoclasical firm is not the unrealistic, profit-maximizing 


General 


office 
if Staff 


Operating Operating Operating 
division A division B division C 
[ Manufacturing | Sales Finance Engineering 


Figure 5 
An example of the multidivisional form. Source: Williamson 1975, 
p. 138 


entity that organization theorists ridiculed in the 1950s 
and the 1960s. Still, the neoclassical theory leaves many 
questions unanswered, and this raises some doubts about 
its ability to cope with certain complex organizational 
phenomena. Simon 1976 and chapters 3 and 5 of Nelson 
and Winter 1982 are particularly instructive in this re- 
spect. Among the neglected topics are the following. 


Optimizing Behavior 


The neoclassical theory presumes that members of an 
organization act so as to maximize an objective function 
of a few standard variables (such as income and effort). 
This raises two questions: Does their objective include 
other variables (prestige, ego, power, number of subor- 
dinates, size of the budget, conviviality, friendship, etc.)? 
Do they even have a well-defined objective function, or 
do they use rules of thumb or “satisfice”? 

Concerning the first issue (arguments of the objective 
function), we must recognize that a number of goals other 
than income and effort seem to mobilize the members’ 
energy. Of course, it is interesting to wonder whether 
these enter directly into the member's utility function 
or whether they are intermediate objectives that help 
achieve the primitive goals. As was mentioned earlier, it 
might be the case that a manager enjoys a large number 


95. uk roles include advisory services and strategic planning. The important 
thing tolfote is that the top management is removed from the routine opera: 
tional acfivities—including, to some extent, the direct supervision of divisions. 
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lof subordinates not per se but rather because this acts as a 
signal to the labor market (a manager in charge of an 
important division is likely to be an able manager) or 
because it allows for smoother operation and thus reduces 
n-the-job pressure (effort). The economist faces a familiar 
lilemma here, An increase in the number of explanatory 
ariables (arguments of the objective function) makes it 
asier to explain real-world phenomena. At the same 
‘ime, the theory loses predictive power: By adding 
jough arguments, one can always “explain” any kind of 
havior. By contrast, the restriction to a small number 
f predetermined primitive variables restores some disci- 
line. How “inspired” the neoclassical economists have 
een in their choices of primitive variables is not yet 
own. 

The second issue—the possibility that economic agents 
lo not optimize—is. of course, a matter of concern. 
deed, members of an organization often use rules of 
umb instead of performing complex computations. How- 
ver, many behaviors that look nonoptimizing may actu- 
lly be the outcomes of optimization under constraints, 
id therefore may be not at all irrational. For instance, a 
ember generally does not have time to sample all the 
ifformation that is relevant to decision making; he may 
thus make decisions that, viewed with better information, 
lok irrational. However, the decisions may be rational, 
iven the shadow price of the time necessary to gather 
the information. Similarly, the time and effort required 
t® compute optimal decisions in complex problems are 
tiaded off against inefficiencies in decision making. It 
rdmains to be seen how successful such “rational explana- 
tipns” of bounded rationality will be.?° 


mmunication and Knowledge 


‘oclassical theory pays only lip service to the issue of 
cgmmunication. Information flows between members of 
organization are limited only because of incentive 
cqmpatibility. A member keeps his information private 
when he would suffer from its becoming public. Private 
infentives certainly limit information flows considerably. 
Hpbwever, even well-intentioned members of an organiza- 


tion (ie,, members who do not manipulate information to 
their advantage) may have trouble communicating all the 
information they possess to their relevant co-members, 
because it is too time consuming or because the informa- 
tion is hard to “codify” (see Arrow 1974) to make it 
understandable to its receivers. Thus, decisions that would 
be profit maximizing under full communication will not 
be made under imperfect communication,*” 

This brings us to the notion of knowledge. Several 
contributions (among the most recent Nelson and Winter 
1982, Kreps 1984, and Cremer 1986) have endowed a 
firm with a stock of knowledge that guides the organi- 
zation’s members in their decisions and coordination in a 
world of imperfect communication: 


The context of the information possessed by an individual 
member is established by the information possessed by all other 
members... To view organization memory as reducible to 
individual member memories is to overlook, or undervalue, 
the linking of those individual memories by shared experiences 
in the past, experiences that have established the extremely 
detailed and specific communication system that underlies rou- 
tine performance. (Nelson and Winter 1982, p. 105) 


Organizational culture is the pattern of basic assumptions that 
agiven group has invented, discovered or developed in learning 
how to cope with its problems of external adaptation and 
internal integration, and that have worked well enough lo be 
considered valid, and, therefore, to be taught to new members 
as the correct way to perceive, think, and feel in relation to 
these problems. (Schein 1984, p. 3) 


Organizational memory helps the members to find 
relatively satisfactory decisions in the presence of com- 
plex decision making (“satisficing” or individual bounded 
rationality) and to coordinate their respective actions in 
the absence of perfect communication (collective bounded 
rationality). 


Dynamics of Organizations 


Neoclassical theory has focused on the optimal design of 
an organization at a given point of time. Little has been 
said about reorganizations. Because most reorganizations 


96] An important area where this reformulation has been fairly successful is 
search theory, which explicitly formalizes the trade-off between costly aequisi- 
tich of information and (ex post) inefficient decision making 


97} Neoclassical theory has recently made some progress in the formalization 
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of these issues (Geanakoplos and Milgrom 1984) and their consequences (Sah 
and Stiglitz 1985), but much work remains to be done. 

Marshak and Radner (1972) analyze optimal decision rules that coordinate 
members of an organization under imperfect communication 
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are, in feneral, not specified in the initial organizational 
they-are such influenced by the authority and 
ing relationships at the time of the reorganization. 


organidation theorists (e.g. Cyert and March [1963]) have 
emphagized that organizational behavior is often best 
predicted by the analysis of group incentives as well as 
individhial incentives. In contrast, neoclassical theory has 
tended|to focus only on individual incentives.?* 


The}reader will have perceived in this discussion of 
the cufrent lacunae of the neoclassical approach much 
intellectual imperialism, Most of it conveys the following 
: “Sociologists and organization theorists are right 

that neoclassical theory has not tackled some 


Of course, this is a pure act of faith, and much of the 
controyersy may revolve around whether the neoclassical 
approath is indeed able to cope with these aspects. 


2.4 The Profit-Maximization Hypothesis and 
Industrial Organization 


As has|been noted, there are many ways in which discre- 
tion cah be curbed, However, none of them is perfect, and 
we shpuld expect some possibly important deviations 
from pfrofit-maximizing behavior,?? Does this mean that 
the prpfit-maximizing models of this book are funda- 
mentally flawed? Not necessarily. Consider the familiar 


where q is the firm's output, P(-) is the inverse demand 
function, w is the manager's wage, and c is the unit cost 
(a function of the manager's effort e and some random 
variable ¢), Suppose that the shareholders observe every 
variable except e¢ and «. In particular, they observe the 
realization of the unit cost c. This unit cost plays the same 
role that the profit variable plays in examples 1 and 2 of 
subsection 2.1. It can easily be shown that the optimal 
wage structure, wv(c), is based solely on the realization of 
c, because ¢ conveys all the relevant information about 
effort (mathematically, it is a sufficient statistic for effort). 
Now, we know from subsection 2.1 that if the manager is 
risk-averse, the effort e induced by the optimal wage 
structure differs from e*, the optimal level that would be 
obtained if effort were observable by the shareholders. 
The firm is profit-maximizing only in a constrained sense, 
and the manager engages in X-inefficiency (e # e*) in the 
sense of Leibenstein 1966. 

However, given the effort ¢ induced by the optimal 
wage structure, w(-), let « = c(e,e) denote the resulting 
random cost. The shareholders’ expected profit is 


EM = P(q)q — (EZ)q — Ewo(?). 


Because the choice of q does not change the managerial- 
control problem (recall that @ is a sufficient statistic), the 
shareholders (or the manager) might as well pick q so as 
to maximize P(q)q — (Ez)g. Hence, for an outside ob- 
server, the firm’s behavior is observationally equivalent to 
that of a firm that does not engage in X-inefficiency, but 
has a cost distribution ¢ for the efficient (full-information) 
level of effort. So never mind that the distribution of the 
unit cost is biased toward high levels because of informa- 
tional asymmetries; the monopoly-pricing model devel- 
oped in chapter 1 is still valid. 

This is only an example; it is not meant to convey 
the impression that this “separability” between internal 
organization and product-market or input-market deci- 
sions is the rule.!°° Indeed, one of the most exciting 
research agendas in industrial organization for years to 


98, Accifding to Nelson and Winter (1982, p. 56); “Although the business 
press frequently reports the internal policy struggles of large firms in a manner 
that cleafy involves informal use of a coalition model, there is little scholarly 
Jiterature|in economics that takes this perspective. The proposals of March 
[1962] ani Cyert and March (1963) have been largely ignored.” 


99, Schefer (3980, p. 41) concludes: ".... assuming profit maximization provides 


50 


a good first approximation in describing business behavior. Deviations, both 
intended and inadvertent, undoubtedly exist in abundance, but they are kept 
within more or less narrow bounds by competitive forces, the self-interest 
of stock-owning management, and the threat of managerial displacement by 
important outside stockholders and takeover raiders” 


100. For a counterexample, see the long-purse story in chapter 2, 
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come is the determination of the scope and importance of 
such interactions. However, this author feels and hopes 
that many of the conclusions of the theory of industrial 
organization will remain valid (at least at a descriptive 
level) when the profit-maximization postulate is aban- 
doned for a full-fledged model of internal organization. 


3 Supplementary Section: 
The Principal—Agent Relationship '®* 


The purpose of this section is to introduce the moral- 
hazard issue in a more general framework than the one in 
the text. 

Suppose that shareholders are risk-neutral. Their objec- 
tive function is equal to the expected gross profit of 
the firm minus the expected wage payment, For ease of 
exposition, assume that there is a unique manager. This 
manager makes an unobservable decision ¢ in an interval 
[2,2]. This decision will be interpreted as an effort level, 
but more generally it could be any discretionary or moral- 
hazard variable (perquisites, care, etc.); it is not observable 
by the shareholders. Given e, the realization of the profit 
depends on the realization of a random variable e: [(e, é). 
Presumably, I increases with e. The shareholders observe 
only the profit level and reward the manager according to 
a wage function of the only observable variable: :o(I), 
Thus, the shareholders’ objective function is 


E[M(e, ) — w(Me,e))]. 


The manager's objective function is the expectation of 
his utility. The latter depends on the monetary reward 
and on the level of effort exerted: Li(w,e). Presumably, 
U increases with w and decreases with e, Assume also 
that LI is concave in w (income risk aversion). Thus, the 
manager's objective function is 


E Uw(M(e, €)), e). 


(Hereafter, all expectations are taken with respect to ¢.) 

In the traditional principal—agent framework, the share- 
holders design a wage contract z(-). There exists ex ante 
a competitive supply of identical managers, with some 
reservation utility Ug (the expected utility that they would 
obtain by working somewhere else). The shareholders are 
able to fill the managerial position only if the highest 
possible expected utility for a manager over all potential 
levels of effort exceeds Uy. In the jargon of incentive 
theory, the manager's “individual rationality” or “partici- 
pation” constraint, 


10}. This supplementary section is partly inspired by part 1 of Hart and 
Holmstram 1987. 
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max E U(w(1(e, €)),e) > Uo, (8) 


aximizes E U(w(I(e, ¢)), e)) over all e. (9) 


|Now consider the shareholders’ problem: Choose a wage 
jucture w*(-) and induce a level of effort e* for the 


subject to constraints 8 and 9. 

Solving this problem is, in general, a complex task.'°? 
There are two polar cases, however, in which the solution 
is|straightforward, 

|Before exposing those cases, it is useful to derive the 
sqlution in the case in which e and ¢ would be observable 
Gy Il information). Because the shareholders observe ¢, 
they can impose any level they want consistent with the 
jnager’s participation (by threatening a large penalty if 
tHe manager does not conform). Thus, the only relevant 
cQnstraint is 8, Now, from the theory of optimal insur- 
ace, for a given e, the risk-neutral shareholders should 
give a constant wage w to the manager, i.e. should give 
him full insurance.'®? The choice of the level of effort is, in 
general, slightly more complex. Let us simply assume that 
the optimal effort under full information is not ¢ (the 
Idwest possible effort for the manager). 

| Let us now return to our asymmetric-information frame- 
ork, in which the shareholders observe only the profit. 
bviously, because the contract is contingent on fewer 
fariables, the shareholders have less control and therefore 
chn expect, at most, their expected full-information profit. 
he allocation they induce by using a small number of 
dbservables can always be duplicated with a larger one by 
dropping the extra observables.) 

The first polar case is that of an income-risk-neutral 
anager. In this case, 


Uw, e) = w — Ve). 


The term ©(e) represents the disutility of effort expressed 
in terms of money. The shareholders’ objective function 
can then be written as 


E( — w) = EM — EU(w, e) — O(e) 
= [EN¢, 2) — O(e)] — Uo, 


where use is made of the manager's risk neutrality and 
of the individual rationality constraint (which can easily 
be shown to be binding in this case). Let e* optimize 
EM (e,2) — ®(e) (the expected profit net of the disutility 
of effort). By definition, e* is the optimal effort under 
full information (because under full information only the 
individual rationality constraint matters). Thus, under full 
information the shareholders’ net profit is 


EM (e*, e) — O(e*) — Up. 


Under asymmetric information, suppose the shareholders 
offer to sell the firm to the manager at price p = E II (e*, #) 
— @(e*) — Up. If the manager accepts, he becomes the 
residual claimant for the firm's profit. The sale of the 
firm to the manager is formally equivalent to an incen- 
tive scheme in which the shareholders would remain the 
claimants for the firm's profit but would pay a wage 
w(Il) = 11 —p. To see whether the manager accepts, 
look at his objective function: 


max(E IT (e, 2) — ®(e) — p] = Uo. 
é 


So the manager accepts, and the shareholders make ex- 
actly the same expected profit as under full information. 
As is explained above, the intuition for this result is 
that the manager faces an incentive scheme that exactly 
reflects the vertical structure’s objective. Thus, he chooses 
the full-information level of effort. The potential draw- 
back is that the manager may bear all the risk. That would 
not matter, however, because the manager is risk-neutral. 

The other polar case is that of an infinitely income-risk- 


12. The pioneers in this area were Wilson (1968, 1969), Ross (1973), Mirrlees 
(3974, 1975), Harris and Raviy (1978), Holmstrim (1979), Shavell (1979), and 
‘ossman and Hart (1983). Very few general results can be obtained in the 
absence of more specific assumptions (see Grossman and Hart 1983). 


qe To prove this, fix ¢ and let fi = M(e,«) denote the random profit. Let 
(w) = Uto, e). The shareholders maximize over to(TT) their objective function 


£( — wit) 
a 
subject to the constraint E Ulw(f1)) > Up. Forming the Lagrangian for this 


problem and taking the first derivative yields '(w(T1)) = constant, so the 
wage is independent of profit. 
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averse manager. Such a manager prefers a random wage 
td; and effort e, to a random wage w, and effort e, if 
and only if mint, > minw,, or minw, = minw, and 
, < ey. That is, the manager is concerned primarily with 
his minimum wage, but in case of a tie he prefers the 
allocation with the lowest level of effort. Suppose further 
that the distribution of I given e has support equal to an 
interval [I1, IT] whatever e, so that the distribution of e but 
not its support moves with e.'°* Then, whatever his effort, 
the manager's lowest possible wage is effort-independent, 
so he chooses ¢ = ¢. In this case, no incentive can be 
given to the manager. He is given a constant wage, the 
value of which is chosen to make the individual rational- 
ity constraint binding given effort e. 

The case of a general managerial utility function (vw, e) 
is more complex. Let us simplify the model by assuming 
that the utility function is separable in income and effort: 
U(w, e) = u(w) — D(e). Assume that «’ > 0, u" <0, ® 
20, ®" > 0, (0) = 0, and ®'(00) = oo. We will also 
use the “parametrized distribution formulation of uncer- 
tainty” (pioneered by Mirrlees [1974] and Holmstrom 
[1979]), according to which the cumulative distribution of 
II given ¢ is described by a cumulative distribution func- 
tion F(T];e) on [11, I], with density f(I;e) > 0. These 
functions are assumed to be differentiable in effort. That 
effort increases profit (stochastically) is formalized by the 
first-order stochastic dominance relation on (I1, I): 


e) > e, => F(M;e,) < F(;e,); 


that is, the distribution for e, puts more weight on 
the upper tail. For a differentiable cumulative distribution 
function, this means that F,(I;e) < 0. 

The manager's optimization problem for a given incen- 
tive scheme w(-), 


fi 
imax (| u(wo(IT)) f(1; ea — %), 
e \Jo 


yields the first-order condition 


Ti 
i u(w(TT)) f.(1; e)d TT — ®'(e) = 0. (10) 
a 


Of course, this condition is not sufficient for optimality of 
effort; the second-order condition must also be satisfied if 
a maximum is to be obtained. For the moment, ignore the 
second-order condition. If the optimal solution for the 
shareholders found by ignoring the second-order condition 
for the manager can later be shown to satisfy the second- 
order condition, then it is truly optimal. 
The manager must also be willing to participate; 


fi 
{ u(w(IT)) f(I1; eal — Be) > Up. (11) 
o 

The “first-order approach” for the shareholders con- 
sists of finding a wage structure w(-) and an effort level ¢ 
that maximizes 


ii 
L -{ [11 — w(Il)) f(T; e) 
a 

+ A(u(w(T1)) — O(c) — Up) f(T: e) 


+ n(u(co(T)) f, (11; e) — ©'(e) F(T; e)) a 11, 


where 4 and » are positive (they are actually strictly 
positive, as can easily be shown), The derivative of L is 
taken with respect to ¢, and that of the integrand with 
respect to w for all 11. We will be concerned only with the 
second differentiation, which yields 


—f(IT; e) + Af(TT;e)u'(w(T)) + nf,(11; e)u'(w(IT)) = 0, 
or 


= 4 plltlie 
wei) "(Te 


(12) 


(Equation 12 need only be true for almost every I], i.e. 
everywhere except possibly on a set of IT with measure 
zero.) It is left to the reader to compute the derivative of 
L with respect to ¢ and to check that the second-order 
conditions for the choice of w(Il) and e are satisfied (con- 
ditional on the manager's second-order condition being 
satisfied). 

The interpretation of the result is particularly simple 
when there are only two possible levels of effort—low 
and high (e, and ¢4)—rather than a continuum. On the 


l04. The case of a moving support is uninteresting, at least if the wage 

inction w(*) is unconstrained. Suppose, for instance. that IT is a strictly increas- 
Ing function of ¢. Suppose further that the shareholders want to induce an 
bitrary level of effort ¢y. Then it suffices to impose a very large penalty on 
he manager if the realized profit TT tums out to be lower than the presumed 
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lower bound I1\(¢,), This deters the manager from choosing any level of effort 
€ < fo, (Such extreme punishments are not feasible if w(*) is constrained. say 
by limited liability.) Thus, a moving support is basically equivalent ta full 
information. The same phenomenon can also occur with nonmoving, infinite 
support, as was shown by Mirrlees (1974). 
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assumption that the shareholders want to induce the 
high level of effort, the incentive-compatibility constraint 


becpmes 


T 
{ u(w(IT)) f(A — (e,) (3') 


whére fy(*) and f,(-) denote the densities for effort levels 
id ¢,. Equation 12 then becomes 


z sy 
fat)" 


Bechuse u' is decreasing, 1/u' is increasing. Thus, the 
higher the relative probability that effort was high when 


en 


it 


=i +n(1 An>0. (12’) 


mahagerial compensation increases with observed profits. 

hen is the first-order approach valid? In other words, 
when does the optimal solution satisfy the manager's 
secpnd-order condition? As was shown with increasing 
geiterality by Mirrlees (1975), Grossman and Hart (1983), 
and Rogerson (1985), the following conditions are suffi- 
cieht for the first-order approach to be valid (and, in 
pajticular, for equation 12 or 12’ to describe the optimal 
compensation scheme): 


Mbnotonic-likelihood-ratio property: f,/f increases with TI 
(or}in the two-level-of-effort case, fy/f,, increases with I). 


Cofwexity of the distribution function: F,. > 0, or (more 
geherally) for all e,, e,, 11, and any « in [0, 1), 


joe, + (1 — a)en) < a F(T;e,) + (1 — a) FIL 2). 


That is, the deterministic effort we, + (1 — Je, is sto- 
chastically superior to effort e, with probability « and to 
effort e, with probability 1 — . This convexity assump- 
tion has the flavor of decreasing returns to scale.'°* 
Equation 12 also teaches us something interesting about 
the value of a signal. Suppose that the shareholders ob- 
serve not only profit [1 but also some other signal, s. This 
signal could be the price of inputs, the performances of 
other firms, or sunspots. If G(II, s;e) denotes the joint dis- 
tribution of I and s for an effort e, with density g(II, 5;e), 
then equation 12 becomes 


1 Se(IL, 5; e) 


wots) 7 +" gitLse Be 


Equation 12” gives the same wage structure as equation 
12 (i.e,, w depends only on IT) if 


gel, sie) _ fe(Tse) 


g(ILs;e) — f(T;e)” Ae 


However, equation 13, if integrated with respect to e, is 
equivalent to the existence of two functions m and n such 
that 


g(I1, s;e) = m(I1, e)n(I1, s). (14) 


Equation 14 says that I is a sufficient statistic for (II, s) 
with respect to e. Thus, the optimal incentive scheme uses 
the extra information s if and only if s is informative 
about e given that I is already available, This theorem is 
proved more formally in Holmstrém 1979 and in Shavell 
1979. 

General results for the moral-hazard problem are rare. 
For instance, a reasonable prediction would be that a $1 
increase in profit results in an increase in the manager's 
wage of between $0 (full insurance) and $1 (residual 
claimancy). However, this need not hold. Even for a sepa- 
rable utility function for the manager (U = (e)u(w) 
— ©(e)—a special case of which, ¢ = 1, was used above), 
one can prove only that the reward function must be 
increasing at some level of profit and has slope <1 at some 


+ constant, 
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where the constant is independent of e (because F(I1;e) = 0 and F(fT;e) = 1 for 
all ¢), Because 10'(I1) > 0 (from equation 12), F,, > 0, and ®" > 0, this objective 
function is concave in e. 
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possibly different) level of profit (Grossman and Hart 
983). Only when there are two possible profit levels, 11, 
ind [1,, does one necessarily have 


bz — Wy, 


< <1 
T= 


where to; corresponds to T1,). Alternatively, one can sup- 
ose that the likelihood ratio is monotonic and that the 
istribution function is convex in the effort to apply 
lhe first-order approach and its conclusions (such as the 
onotonicity of the reward function). Getting more spe- 
ific results requires stronger assumptions on the distri- 
ution and the utility functions. 


Sane 
Answers and Hints 


Exercise 1 
(i) The efficient amount of investment is given by 


max[o(I) — 1}. 
L 


This yields J* = 2. 
(ii) Ex post bargaining leads to v(I) — p= p—c, so 
p =[v(I) + cl/2. Ex ante, the supplier maximizes 


o(l) +¢ ree Nae. Sa 


a Ge ae 


2 Ze Tah 
This yields | = 1 (suboptimal investment). Not investing 
at all is not profitable, because c < 4. 

(iii) If the buyer has the right to buy at price p, the 
supplier either will not invest or will invest the minimum 
amount so that the buyer buys: v(/) = p. The second 
strategy yields profit p — c — v™'(p). If p = v(2) = 4, the 
investment is 2 and the profit is 2—c>0. If p<4, 
investment is suboptimal or nonexistent, If the supplier 
has the right to sell at a given price, investment is nil. 

(iv) Efficient investment. 


Exercise 2 


(i) U(w,e) should be constant for insurance purposes. 
Thus, u(w) — ®(e) = Uy for all f. But the minimization 
of w+ f —e subject to the previous constraint yields 
w = Wand e = e* for all f if A is small (i.e., if A < (e%)). 

(ii) Let «(B) and e(B) denote the wage and the effort in 
state B. Incentive compatibility requires that 


i + Aw(p) — B) — W(e(p)) 
> W+ Alw(B) — B) — W(e(f) — (Bf — B)) 
or 
A(w(B) — w(B)) > O(e(B)) — B(e(B) — (B — pi). 


Using the convexity of ® and the condition 4 < ©'(e) for 
all e, we get 


w(B) + B— e(B) > w(B) + B — elf). 


Thus, the cost for the principal cannot be lower in state 8 
than in state B. For any wage structure, the manager's 
utility is 
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maxiu(w(B — e)) — O(e)). 


(Bedause of the previous incentive-compatibility con- 
straint, the worst state of nature for the manager is f.) 
Thefe two properties imply that the contract in the text 
is optimal. Because monetary incentives are too costly, 
the principal contents himself with a given cost target. 

i) The cost of having two managers under full infor- 
matjon is 2% + EB — e*, But this cost is reached under 
the lyardstick-competition scheme in the text: In state B, 
marjager i maximizes 


may[w + @'(e*)(e — e*) — D(e)]. 


Herjce, e = e” for all p. 
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Monopoly 


This chapter discusses various arguments made in favor 
of and in opposition to monopoly power. We will assume 
here that the goods produced by the monopolist are 
given, and that their qualities are known by the con- 
sumers. We will also assume that the monopolist charges 
the same price per unit of good for each good produced. 
(More specifically, there is no price discrimination at a 
given point of time. We will, however, consider inter- 
temporal price discrimination.) 

The best-known monopoly distortion, that related to 
pricing strategy, will be tackled in section 1.1. In contrast 
with the behavior of a competitive firm whose product 
demand is infinitely elastic by definition (and which takes 
the price as given), a firm exercising monopoly power 
over a given market can raise its price above marginal 
cost without losing all its clients. Such behavior leads to a 
price that is too high and to a “dead-weight” welfare loss 
for society (unless the firm is able to “price-discriminate” 
perfectly, as we shall see in chapter 3). 

We shall recall the main aspects of the pricing behavior 
of a single-product monopolist. We shall then consider 
a multiproduct monopolist with interrelated production 
costs of or interrelated demands for his various products. 
Last, we shall study the intertemporal pricing behavior of 
a durable-good monopolist. 

Other distortions may also exist. On the one hand, 
both theory and practice suggest that it is more difficult 
for the owners of a firm to keep control over its costs 
when the firm has monopoly power on the product mar- 
ket. Thus, a monopolist may produce given outputs at a 
higher cost than a competitive firm (section 1.2). On the 
other hand, the monopoly rent may give rise to a contest 
among several firms to obtain or secure it. This contest 
may involve socially wasteful expenditures, which partly 
dissipate the monopoly rent. Therefore, monopoly profit 
should not always be taken into account in the expression 
of welfare (section 1.3). 

Naturally, the conclusions would hold as well for monop- 
sony power (i.e, monopoly power in the input markets). 


LT 
1.1 Pricing Behavior 


The best-known monopoly distortion results from the 
monopolist's pricing behavior, To focus on this distor- 
tion, we assume that the monopolist’s products are given 
and that their existence and quality are known to con- 
sumers, We start by reviewing the distortionary markup 
by a monopoly producer of a single good. We then study 
the multiproduct monopolist. Last, we consider the issue 
of intertemporal pricing by a durable-good monopoly. 


1.1.1 A Single-Product Monopolist 


1.1.1.1 The Inverse Elasticity Rule 


Let q = D(p) be the demand for the good produced by 
the monopoly, with inverse demand function p = P(q). 
Let C(q) be the cost of producing q units of this good. 
Assume that demand is differentiable and decreasing with 
the price (ie., D'(p) < 0),! and that cost is differentiable 
and increasing with the output. A profit-maximizing mo- 
nopolist chooses the monopoly price p™ so as to 


max (p D(p) — C(D(>))I. 
i 


The first-order condition for this problem is 


Dip™) 
Dip") 


p® — C(D(p™)) = — 


(1.1) 


where « = —D’p™/D denotes the demand elasticity at 
the monopoly price p™. Letting q™ = D(p™) denote the 
monopoly output, one can rewrite the first-order condi- 
tion as the equality between marginal revenue and 
marginal cost: 


MR(q™) = P(g™) + P’(q™)q™ = C'(g™). 


For now, we ignore the second-order condition of the 
maximization problem. Equation 1.1 indicates that the 
relative “markup’—the ratio between the profit margin 
(price minus marginal cost) and the price; also called the 


Lerner index—is inversely proportional to the demand 
elasticity. The monopoly sells at a price greater than the 
socially optimal price, which is its marginal cost. The 
price distortion is larger when consumers, facing a price 
increase, reduce their demand only slightly. The intuition, 
of course, is that the monopolist is more wary of the 
perverse effect of a high price on consumption when 
consumers react to a price increase by greatly reducing 
their demand. 

If the elasticity of demand is independent of price (the 
demand function is q = kp~*, where k is a positive con- 
stant), the Lerner index is constant. The monopolist 
adjusts his price to shocks on the marginal cost by using 
a constant (relative) markup rule. For instance, if his tech- 
nology exhibits constant returns to scale so that the mar- 
ginal cost is equal to the average cost or unit cost and 
if the elasticity of demand is 2, the monopolist system- 
atically charges twice the unit cost. Thus, if we observe 
a monopolist using such a “rule of thumb,” we should 
not necessarily conclude that this monopolist’s pricing 
behavior is not (privately) optimal. 

More generally, observe that a monopoly always oper- 
ates in a price region such that the elasticity of demand 
(from equation 1.1) exceeds 1. Where the elasticity is 
lower than 1, the monopolist’s revenue—and, a fortiori, 
his profit—are decreasing in quantity (i.e., increasing in 
price). 

It is a simple yet a very general property of monopoly 
pricing that the monopoly price is a nondecreasing func- 
tion of marginal cost. To see this, consider two alternative 
cost functions for the monopolist: C,(-) and C,(+). As- 
sume that these cost functions are differentiable, and that 
C3(q) > Cy(q) for all q > 0. No other assumption on 
these cost functions is required. Let p;" and qj" denote 
the monopoly price and quantity when the cost function 
is C,(+); p& and q¥ are defined similarly. When the cost 
function is C,(-), the monopolist prefers charging py 
rather than any other price. In particular, he could charge 
price p{" and sell quantity 43”. Thus, 


pram — Cig) > pear — Ci @P)- (1.2) 


Similarly, the monopolist prefers to charge pj" rather than 
p® when his cost function is C2(+): 


1, See the introduction, 


2. See the introduction: 
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raz — C242") > pram — Gar). (1.3) 
Adding equations 1.2 and 1.3 yields 
[Cz(gP) — C2(ga")) — [Cy (git) — Cy(qP)) > 0, (1.4) 
or 

a 
i (Cx) — Cy @)]dx > 0. (1.5) 


Because C3(x) > C(x) for all x, equation 1.5 implies that 
qf > q7- In other words, the monopoly price is a non- 
decreasing function of marginal cost.* 


1.1.12 The Dead-Weight Loss 


Equation 1.1 provides a quantification of price distortion, 
but from a normative viewpoint the appropriate measure 
of distortion is the loss of social welfare. To measure the 
latter, we compare the total surplus at the monopoly price 
with that at the competitive (marginal-cost) price. The 
total surplus is equal to the sum of the consumer surplus 
and the producer surplus (or profit), or to the difference 
between total consumer utility and production costs.* In 
figure 1.1 this surplus is represented by the area DGAD 
under marginal-cost pricing and by the area DEFAD 
under monopoly pricing. 

The net consumer surplus under monopoly is the area 
of the “triangle” CDE in figure 1.1. The monopolist's 
profit is equal to the total revenue, p™q™, minus the in- 
tegral of the marginal cost—i.e., equal to the area of the 
“trapezoid” ACEF. Thus, the “dead-weight” welfare loss 
is equal to the area of the “triangle” EFG. (These are a 
proper triangle and trapezoid only if the demand and 
marginal-cost curves are linear.) 

The welfare loss does not necessarily decrease with the 
elasticity of demand, even though the relative markup 
does (from equation 1.1). The monopoly situations for 
which we observe strong price distortions correspond to 
those in which demand elasticity is low, so that con- 
sumers decrease their quantity demanded only slightly 
in response to a unit price increase. Consequently, in 
precisely these situations, price changes do not affect 
quantity consumed very much; rather, they elicit a large 


ro 


oD Net consumer surplus 


'(q) = Marginal cost 
Welfare 
loss 


P(q) = Demand 


Marginal revenue 


q™ > 


cml 


Figure 1.1 


monetary transfer from consumers to the firm. Hence, we 
cannot conclude that the welfare loss is monotonic in the 
elasticity of demand, 


Exercise 1.1** In a monopolized industry, the demand 
function has a constant elasticity: g = D(p) = p™* where 
& > 1 is the elasticity of demand. Marginal cost is con- 
stant and equal to c, 

(i) Show that a social planner (or a competitive indus- 
try) would yield a total welfare of 


We =cl*/(e — 1). 


(ii) Compute the welfare loss, WL, under monopoly. 

(iii) Show that the ratio WL/W‘ (relative dead-weight 
loss) increases with ¢, that WL is nonmonotonic in ¢, and 
that the fraction IT"/W* of potential consumer surplus 
that can be captured by the monopolist increases with e. 
Discuss the result. (Note that the “size” of the market 
changes with «.) 


Exercise 1.2" Suppose that all consumers have unit de- 
mand. They buy 0 or 1 unit of the good produced by the 
monopolist. They are identical, and they have willingness 
to pay (valuation) 5 for the good. Show that monopoly 
pricing does not create a welfare loss. 


3, This style of proof is familiar from the literature on incentives, Though less 
familiar in industrial organization, it will be used occasionally in this book. 

The monopoly price may be nonunique owing to nonconcavities in the 
profit function. It is then a correspondence rather than a function, The result 


67 


then says that any optimal price for cost function C,(*) (weakly) exceeds any 
optimal price for cost function C,(°). 


4. This criterion ignores problems of income distribution; see the introduction, 
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Of course, the dead-weight welfare loss represents only 
what can be gained from moving from a monopoly situa- 
tion to an ideal situation. It thus yields an upper bound on 
the efficiency gain to be realized by correcting monopoly 
pricing. The actual efficiency gain must be computed for 
any policy intervention that does not yield marginal-cost 
pricing. To put it another way, a high total distortion 
associated with the monopolization of an industry is a 
signal that some public intervention might be desirable, 
but it does not suggest a course of action. An analyst 
or a government should begin analyzing the causes of 
monopolization (see chapter 8), as well as the set of 
potential interventions. The latter will depend crucially 
on the information available to the social planner con- 
cerning industry conditions (cost structure, demand), 


Remark The welfare loss can be measured empirically 
by estimating the demand curve and the marginal-cost 
curve. For a discussion of methodology and results see 
Scherer 1980, p. 461. Harberger’s (1954) estimate of a 
total welfare loss not exceeding 0.1 percent of the gross 
national product implied that economists were wasting 
their time focusing on the monopoly-pricing problem and 
gave rise to much controversy about both the data and the 
methodology.* Industrial-organization economists are, in 
general, mainly interested in industries which are mo- 
nopolized to at least some extent. An economy-average 
number understates the typical distortion in those indus- 
tries because it includes many fairly competitive indus- 
tries in the sample. The dead-weight loss is only one of 
the harmful effects of monopoly, as we shall see below. 
(Including some of the other distortions, such as those 
associated with rent seeking, some researchers found wel- 
fare losses of up to 7 percent of GNP. See Cowling and 
Mueller 1978 and Jenny and Weber 1983; see Scherer 
1980 for a skeptical view of these high estimates.) 


1.1.1.3 The Effect of Commodity Taxation 


Consider one possible policy prescription for restoring 
the social optimum in the presence of monopoly. Suppose 
that the govemment taxes monopoly output at the rate t. 


Then the monopolist chooses p to 


max[p D(p + f) — C(D(p + )), 
? 


from which it follows that 
Dip+Hh+D(p+h(p—C')=0 

or 

[D(p + ) —#D(p +H] + D(p+h(p+t—C)=0. 


To restore the social optimum, marginal cost C’ must 
coincide with the price faced by the consumers (p + #) 
and thus with the marginal utility in terms of money to 
the consumers. Therefore, we must set 


t= D(p*\/D'(p*) <0 


(ie, t/pS = —1/e), where p* is the competitive price 
(determined by the intersection of the demand and 
marginal-cost curves in figure 1.1). Since t < 0, we must 
subsidize the output of the monopolist. We can explain 
this rather paradoxical result as follows: The problem 
with monopoly pricing is that it induces consumers to 
consume too little of the good. In order to achieve an 
efficient allocation of resources, we induce them to con- 
sume more by subsidizing the good. 


Exercise 1.3" A monopolist’s marginal cost of supplying 
a good to consumers is = c + t (where f is a unit com- 
modity tax). Let p™(¢) denote the corresponding monop- 
oly price. 

(i) Compute dp™/dé for the following demand functions: 
p= a— Bq p=a—blng. 

(ii) Sumner (1981) uses an ingenious approach to esti- 
mate the elasticity of demand—and thus the degree of 
monopoly power—in the American cigarette industry. 
He notes that in the United States, commodity taxes— 
and therefore the generalized cost <é—vary across states. 
Although data on c are hard to obtain, data on ¢ are 
readily available. Sumner uses varying levels of taxation 
across states to estimate the elasticity of demand. Bulow 
and Pfleiderer (1983) argue that the method has limited 
applicability. What do you think? 


5, In particular, Harberger assumed unit demand elasticities, which creates a 
downward bias in the estimation of welfare losses, Furthermore, Bergson (1973) 
showed that Harberger’s partial-equilibrium approach can be a major source of 
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bias on either side. Another drawback of Harberger’s approach is the identifica- 
tion of the competitive profit rate with the man (cross-sectional) profit rate, 
which incorporates monopoly profits. 
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Despite the simplicity of the result, the subsidy solu- 
tion has only a few advocates, Its critics point out that the 
concept of total surplus accords equal weight to consumer 
surplus and to the monopoly profit of the firm's share- 
holders, so that a pure transfer from consumers to share- 
holders has no reported social cost. The implementation 
of such a policy raises further problems. It is difficult for 
the government to estimate demand elasticity and to 
determine the marginal cost of the monopolist. Of course, 
it is in the firm's self-interest for the state to err in grant- 
ing too large a subsidy.® Faced with this situation, the 
firm will seek to “inflate” the subsidy by its actions and in 
its dealings with the government. To use such a subsidy 
policy in a discriminating way, the government will most 
likely need to obtain some information about demand and 
cost directly, and not through the monopolist. Demand 
information can be obtained through sampling, although 
this technique is potentially expensive and may be hard 
to implement if the monopolist supplies only a few large 
customers. Cost information is even harder to extract, 
because the monopolist is, for obvious reasons, reluctant 
to release accurate estimates of its cost structure.’ Alter- 
natively, the government can offer the monopolist incen- 
tives to reveal its cost structure. For instance, it can re- 
ward (in a lump-sum way) the monopolist when the latter 
charges low prices. The government may thus induce the 
monopolist to charge a low price when he has a low 
marginal cost. This type of policy tends to reduce the 
dead-weight loss. 

By considering “sophisticated” incentive schemes, we 
are moving away from industrial organization proper into 
regulation—a realm in which a subsidy policy is not 
optimal any longer, as there exist alternative regulatory 
schemes that yield lower welfare losses.’ Why stay away 
from regulation? First, it is a large field that can hardly be 
treated in a concise manner; second, its theoretical foun- 
dations require some familiarity with the theory of incen- 


tives, which would require further developments. The 
point here is simply that the government's incomplete 
information about market conditions creates difficulties 
for intervention. For a correct treatment of the matter, 
informational asymmetries should be explicitly introduced 
into the model; then the efficiency of various types of 
intervention (including commodity taxation) should be 
analyzed. 


1.1.14 Second-Order Conditions 


Let us return briefly to second-order conditions, which 
require concavity or quasi-concavity of the objective func- 
tion. It happens that the profit function of the monopolist 
is not always concave even if his cost function is convex. 
The problem is that the revenue function may not be 
concave—that is, marginal revenue may not be decreas- 
ing everywhere. The second derivative of the revenue 
function R(p) = p D(p) is 


R"(p) = 2D'(p) + pD"(p). 


Our assumption that demand is downward sloping en- 
sures that the first term in R"(p) is negative. The second 
term is nonpositive if demand is linear or, more generally, 
concave. If demand is convex, the revenue function—and 
thus the profit function—may not be concave.” 


Exercise 1.4* Assume that demand has constant elas- 
ticity e: 
q=D(p)=p™. 


Suppose that the cost function is convex. Show that the 
monopolist’s profit function is quasi-concave if > 1. 


1.1.2 Multiproduct Monopoly 


Consider now the case of a multiproduct firm which has 
monopoly power over all the goods it manufactures. It 


6. The “envelope theorem’ supports this. Where I denotes the monopolist's 
profit, dTi/dt = (p — C')D' = —Dusing the first-order condition. 


7, The monopolist may not know the exact cost structure himself, of course. 
What matters for the argument, however, is simply that the monopolist has 
private information about the technology. 


8. For an analysis of optimal price regulation under asymmetric information 
about the technology, see the pioneering papers by Baron and Myerson (1982) 
and Sappington (1982) on the single-product monopolist and that by Sapping- 
ton (1983) on the multiproduct monopolist. For an analysis of optimal price and 
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cost regulation under asymmetric information about the technology and under 
moral hazard, see Laffont and Tirole 1986. For surveys of this line of research 
and further topics (ex ante and ex post competition, dynamics, etc.), see Baron 
1986, Besanko and Sappington 1987, Caillaud et al. 1988, and Sappington and 
Stiglitz 1987. 


9. If the objective function is not concave, the achievement of the social 
‘optimum by a subsidy policy becomes still more difficult. (The monopolist’s 
reaction function—the determination of the price, p. depending on the tax, t, 
imposed—is discontinuous. See Guesnerie and Laffont 1978 for a discussion of 
this point.) 
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produces goods i = 1,,,., n, charges prices p = (py,.--, 
pq). and sells quantities q = (q,...,4,), where q; = D,(p) 
is the demand for good i. The cost of producing the 
output vector is C(qy,-.--4n)> 

In subsection 1.1.1 we analyzed the case of a single- 
product monopoly or, equivalently, that of a multi- 
product monopoly for which demands are independent: 
4; = D,(p;) (the demand for good i depends only on the 
price of good i) and total cost can be decomposed in 
subcosts: 


CQ) = by Gilad 


(cost separability). The pricing problem can then be de- 
composed into n subsidiary pricing problems. Equation 
1.1 tells us that the monopolist imposes a higher markup 
on those goods with a lower elasticity of demand. We 
will derive a straightforward implication of this result in 
chapter 3, where we will reinterpret a manufacturer sell- 
ing the same good in several distinct markets as a multi- 
product monopolist. This result represents the simplest 
form of “Ramsey pricing,” which depicts how markups 
should vary with the elasticities of demand.'® 

More generally, the multiproduct monopolist maximizes 


py piDj(p) — C(D,(p),. «+, Dy p)). 


This results in the following formula, which generalizes 
the equality between marginal revenue and marginal cost: 


aD, aD, aC 4D, 
(o+n2)+ 5 ngea=d ey 
7 


a for alli. (1.6) 
Op; i” Op, 04; Op; 


To analyze this formula, we will consider the two polar 
cases. (The second-order conditions for equation 1.6 will 
not be discussed here.) We will phrase the results in terms 
of the biases that would result if the firm were operated 
by n independent divisions, each producing one good and 
maximizing profit on that good. 


1.1.2.1 Dependent Demands, Separable Costs 


Let us assume that the total cost can be split into n costs: 


Clays.) = ¥ Glad 


Then, after some algebraic manipulation, equation 1.6 
becomes 


w-G_1 


y GD jes, 
Pi 


mi Rew 


where ¢;;= —(@D,/dp;)(p;/D,) is the own elasticity of 
demand (which we will assume to be positive), &, 
= —(@D,/ap,)(p;/D)) is the cross-elasticity of demand for 
good j with respect to the price of good i, and R; D; 
is the revenue associated with good i. 

First, consider the case of goods that are substitutes, i.e., 
for all j different from i, 6D,/ap, > 0 or & <0. In this 
case, the Lerner index for each good i exceeds the inverse 
of the own elasticity of demand. This can be explained 
simply: An increase in the price of good i raises the 
demand for good j. So, if the firm is decomposed into n 
divisions, each producing and marketing its own good 
and maximizing its own revenue (R; — C;), each division 
charges too low a price from the point of view of the 
aggregate firm. The divisions are de facto competitors 
because of the substitutability between their goods. Hence, 
they must be given incentives to raise their own price 
(eliminate the externalities between them). 

Second, for complements (@D,/dp; < 0 for all j different 
from i), the inverse of the own elasticity of demand ex- 
ceeds the Lerner index for each good. This can easily be 
understood: A decrease in the price of good i raises the 
demand for good j. An interesting phenomenon that may 
arise with complements is that one or several of the 
goods may be sold below marginal cost (so their Lerner 
index may be negative), so as to raise the demand for 
other goods sufficiently. This possibility will be demon- 
strated in chapter 3. 


10. See Ramsey 1927 and Robinson 1933. The result is due to Robinson; the 
link with Ramsey's contribution was made later. The traditional Ramsey con- 
text is that of a multiproduct firm whose objective is the maximization of social 
welfare rather than profit. Boiteux (1956) constructed a general-equilibrium 
model in which the social planner, having authority over some public firms, 
maximizes 2 social-welfare function subject to the constraint that these firms 
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make non-negative profits. The general Ramsey formula naturally depends on 
the cross-elasticities of demand and on the elasticities of supply, On this topic 
see also Baumol and Bradford 1970, Sheshinski 1986, and Brown and Sibley 
1986, 

These models contain no endogenous explanation of the budget constraint 
for the public sector, 
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Exercise 1.5* _A firm has monopoly power on the produc- 
tion of nuts (good 1) and bolts (good 2). Nuts and bolts 
are perfect complements. Thus, demand depends only on 
the total price: Dj(p,,p2) = D(p, + p2) for all i. Show 
that equation 1.6 boils down to the monopoly-pricing 
formula in a single composite market. 


An Application: Intertemporal Pricing and Goodwill 


Consider a monopoly producer of a single good, This 
good is sold in two consecutive periods: | = 1, 2. At date 
1 the demand is q, = D,(p,) and the production cost is 
C,(q); at date 2 the demand is 4, = D,(p.p,) and the 
production cost is C3(q2). There is a goodwill effect, in 
that a lower first-period price raises first-period demand 
and raises second-period demand as well: dD,/ép, < 0."" 
The monopolist’s profit is thus 


PD, (p,) — Cy (Dy (ps) 
+ 5(p2Ds (po, Pr) — C3(D2(P2. pr))), 


where 6 is the discount factor. Letting D, = 6D, and 
C, = 5C;, we can rewrite this profit as that of a multi- 
product monopoly with interdependent demands. The 
two economic goods are the single good at the two 
different dates. From our previous analysis, we can con- 
clude the following: As @D,/ép, = 0, the monopolist 
charges the monopoly price in the second period condi- 
tional on the goodwill accumulated in the first (in other 
words, the second-period Lemer index is equal to the 
inverse of the second-period elasticity of demand). In the 
first period, however, the monopolist charges a price 
under the static monopoly price, i.e., under the price that 
maximizes p,D,(p,) — C,(D,(p,)). This is very natural, 
because the monopolist realizes that a lower price today 
raises demand tomorrow. He then takes a dynamic per- 
spective by sacrificing some short-run profits to raise 
future profits. 


1.1.2.2 Independent Demands, Dependent Costs 


Let us now assume that the demand for good i depends 
on its price only: q; = D,(p,). Designing a taxomony for 
dependent costs is a bit more complex than designing 
one for dependent demands. Indeed, although in the 
dependent-demand case one can easily envision a set of 
divisions, each in charge of one product, it may be rather 
unnatural to separate total cost into several components. 
Yet there are some cases in which such a decomposition 
may be reasonable. The application below, also drawn 
from an intertemporal problem, illustrates this. Before 
tuming to the application, however, the reader would do 
well to tackle the following exercise. 


Exercise 1.6"* A power plant (or a hotel, or an airline) 
faces two types of demand: off-peak (q, = D,(p,)) and 
peak (9, = D,(p)), where D,(p) = 2 D,(p) with 4 < 1. 
(For simplicity, the demands are independent.) The mar- 
ginal cost of production is c (as long as capacity is not 
satiated). The marginal cost of investing one unit of 
capacity is y. The same capacity serves peak and off-peak 
demands, ‘? 

(i) Show that if off-peak demand is small relative to peak 
demand (where “small” is to be defined), the monopolist 
equates marginal revenues to ¢ and (c + y) respectively. 

(ii) Treat the case in which off-peak demand is not 
small. Solve the case in which demands have constant 
elasticity. 


Application: Learning by Doing 


In some industries, cost reductions are achieved over time 
simply because of learning. Through repetition of its ac- 
tivity, the firm gains proficiency, Learning by doing is 
especially apparent in industrial activity. For example, in 
the 1920s the commander of Wright-Patterson Air Force 
Base noted that the number of direct labor hours required 
to assemble a plane decreased as the total number of 


11. The "reduced-form” demand function is not quite satisfactory. The rational 
foundations of goodwill must be based on the analysis of consumer behavior; 
see chapter 2. 

The astute reader may have noticed that the formalization of the demand 
curve in period 1 implicitly assumes that the monopolist chooses the two prices 
sequentially, i.e., does not commit himself about p, in period 1. Otherwise, with 
rational consumers living for two periods, the announcement of a low p, in 
period 1 encourages the consumers to try the good, because they will enjoy a 
high surplus if they like the good. In this case, D, decreases with p. 

‘The reader may also feel that calling D,(p,) a “first-period demand function” 


ni 


is a bit misleading, because rational consumers consider the possibility of repeat 
purchase when deciding whether to buy in the first period. Maybe the best way 
of thinking about this model at the current stage is as follows: One can envision 
two different groups of consumers at the two dates. The goodwill effect stems 
from word of mouth between the two generations. The more consumers there 
are at date 1, the more the generation-2 consumers learn about the characteris- 
tics or the existence of the product. 


12, Optimal pricing by a firm producing several goods from the same capacity 
was first studied by Boiteux (1949) 
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aircraft assembled increased. More recently, learning by 
doing has been observed in the manufacture of semi- 
conductors and computers.'* 

Consider a single-good monopolist producing at dates 
f= 1, 2. At date t, the demand is q, = D,(p,) (demand can 
be time dependent). The total cost is C,(q,) at date 1 and 
C3(g2,q) at date 2, where @C,/dq, < 0. We are thus 
assuming that a higher production at the beginning lowers 
the production cost later—i.e., that “practice makes per- 
fect.” The monopolist’s profit is then 


PrDy(pr) — C(Dy (pr) 
+ 5(p2D3(p2) — C,(D3(p2), Dy(p;))). 


The maximization of this profit with respect to p, and p2 
(ie., equation 1.6) leads to equality between the marginal 
revenue and the marginal cost (with respect to current 
output) in the second period. However, in the first period 
the marginal revenue is lower than the marginal cost. 
Thus, the monopolist charges less than the one-period (my- 
opic) monopoly price (the price that maximizes p, D,(p;) 
— C,(D,(p,))) in the first period; this policy enables him 
to sell more, which increases production and learning.** 
Put differently, the firm would underproduce in the first 
period if it were run by two consecutive managers max- 
imizing short-term profit. Exercise 1.7 shows in a slightly 
more general model that one can obtain a further result if 
the demand is stationary and costs decrease with expe- 
rience: that the firm’s output grows over time. This result 
would be very natural if the firms behaved myopically. 
The decrease in marginal cost due to leaming by doing 
leads to an output expansion. However, a nonmyopic 
firm also desires to produce much in the first period in 
order to leam. The result shows that the second effect is 
dominated by the first. 


Exercise 1.7°**15 The monopoly producer of a single 
good has a constant unit cost c(co(#)) at time f, where c(t) 
is the firm’s “experience” at that date. (Assume c > 0, 
ce’ <0, and lim,.. c(f) > 0.) Time is continuous and 
runs from zero to infinity. Experience accumulates with 


production: dco(t)/dt = q(t), where q(t) is production at 
date #. (Those who have done the empirical work have 
assumed, as we do, that production exhibits constant 
instantaneous retums to scale and that the appropriate 
measure of experience is cumulative output.) Let R(q) 
denote the revenue function as a function of quantity (sup- 
posing demand is invariant). Assume R’ > 0 and R” <0. 
Let r denote the interest rate. The monopolist’s objective 
function is 


i * [R(@tt)) — cleo(t))q\t)leat. 
oO 


(i) Show that at each instant the monopolist sets mar- 
ginal revenue equal to the average (discounted) unit cost 
in the future: 


Al) = {. c(an(s))re" ds, 


Hint: Consider the current cost and the future savings 
from changing q(t) slightly. 
(ii) Show that output increases over time. 


1.1.3. A Durable-Good Monopolist 


As was noted above, in the case of a product that gives 
rise to goodwill the firm ought to take a dynamic per- 
spective and sacrifice some current profits to enhance 
future profits. Repeat purchases (which will be studied 
more generally in chapter 2) are an instance of a dynamic 
link between the periods: Customers are more likely to 
buy tomorrow if they do so today. Here we investigate 
another kind of intertemporal link on the demand side— 
one that is associated with the durability of goods. We 
now assume that the lifetime of the good exceeds the 
basic “period” (i.e. length of time between price revi- 
sions). In contrast with the goodwill paradigm for non- 
durable goods, a customer who buys a durable good 
today is unlikely to buy the same good tomorrow. Thus, 
the goods offered by the monopolist at two different 
dates are substitutes rather than complements. (Intertem- 


13. One of the very first theoretical analyses of this phenomenon is Arrow 
1961. 


14, Leaming by doing can also be viewed, to some extent, as a form of 
dynamic increasing returns to scale (see Scherer 1980, chapter 4). In particular, 
it is easily seen that a competitive equilibrium cannot exist under learning by 
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doing if instantaneous costs exhibit constant returns to scale (see Fudenberg 
and Tirole 1983). For an existence theorem with convex instantaneous produc- 
tion costs, see Rasmiussen 1986, 


15. This exercise is drawn from Fudenberg and Tirole 1983. 
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poral pricing by a durable-good monopolist is studied in 
much detail in the supplementary section; only the broad 
issues will be mentioned here.) 

As we have seen, a durable-good monopolist creates 
his own competition. By selling today, he reduces de- 
mand tomorrow. As we will see, to sell to the residual 
demand, the monopolist lowers the price tomorrow. But 
consumers ought to expect a price decrease and hold back 
on their purchases today. These rational expectations hurt 
the monopolist. 

Suppose that there are seven consumers. These con- 
sumers have “willingnesses to pay” or “valuations” v = 1, 
2,...,7, respectively; v represents the present discounted 
value of the flow of services from the date of purchase on. 
Each consumer can derive utility from only one unit of 
the durable good. Assume further that there is no cost to 
produce the good and that the good is infinitely durable. 
Time is discrete: t = 1,2,.... The discount factor be- 
tween periods is 5. 

Assume first that the monopolist makes a once-and- 
for-all offer in the first period. (This thought experiment 
is meant to describe what happens in the absence of 
intertemporal effects.) The monopolist then charges the 
monopoly price, p™ = 4, and sells to consumers with 
valuation 4 to 7. (The monopoly profit is equal to 16.) 
Now consider the multiperiod model. Suppose that the 
monopolist charges 4 in period 1, and that consumers 
with valuations exceeding 4 accept. At the beginning of 
period 2, the monopolist is left with a residual demand, 
composed of the consumers with valuations 1 through 3. 
The monopolist is then tempted to charge a lower second- 
period price. For instance, if the second period is the last 
period at which the monopolist sells,!® he charges the 
monopoly price corresponding to the residual demand, 
i.e., 2. Now, consider what happens when the consumers 
realize in period 1 that the monopolist will have, ex post, 
an incentive to lower the price in period 2. Consumers 
with high valuations may still accept paying 4 because 
they are eager to get the good.'’ However, the consumer 
with valuation 4, for instance, does not buy, because he 
would get a zero surplus whereas by waiting he could get 


a positive surplus. Thus, the expectation of future price 
cuts reduces the demand in period 1. 

To solve for the equilibrium, one must find a sequence 
of prices and consumers’ expectations such that the ex- 
pectations are rational given the firm's behavior and such 
that the firm’s behavior is optimal given the consumers’ 
expectations. The supplementary section explains how to 
do this. The equilibrium takes the form of a decreasing 
price sequence. Thus, the monopolist price-discriminates 
over time; He first charges a high price and sells only to 
the consumers who are most eager to buy the good. He 
then cuts his price to reach a slightly less eager clientele, 
and so on. This type of intertemporal discrimination 
behavior is often encountered in practice. For instance, 
books are often introduced in hardcover and then pub- 
lished in paperback form a few months or years later. 
It is well known that the production-cost difference 
between a hardcover and a paperback is fairly small. 
Thus, most of the price differential can be explained by 
the intertemporal-discrimination model. Another example 
is the first-run movie feature that is shown later on tele- 
vision, as a home video, on airlines, or at second-run 
moviehouses. 

The flexibility that the monopolist has to adjust his 
price over time actually hurts him. Indeed, it can be shown 
that he would be better off if he could ex ante commit 
himself not to haggle, i.e., not to lower the price once 
high-valuation consumers have bought. (“No haggling” is 
actually optimal for the monopolist when he can commit. 
The fixed price is then, of course, the monopoly price.) 
This is explained by the fact that consumers wait for the 
day when the monopolist will cut his price. Here, price 
discrimination is involuntary—the firm would, ex ante, 
prefer not to be able to discriminate. Further, it can be 
shown that the profit loss for the monopolist under non- 
commitment becomes very high when his price adjust- 
ments are frequent; in fact, a conjecture due to Coase (and 
proved by other researchers; see supplementary section) 
states that when price adjustments become more and 
more frequent the monopolist’s profit converges to zero. 
All trade takes place almost instantaneously, at prices 


16, This may occur if the monopolist has “outside opportunities” or a fixed 
cost of production and/or marketing, which induce him to leave the market 


73 


17, In order for the consumer to accept, v must satisfy v — 4 > 4(v — 2) or 
v > (4 — 26)/(1 — 4). Such v's exist if the discount factor is not too close to 
unity—ie,, if consumers are impatient. 
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close to marginal cost. This result may be extreme, but it 
illustrates the issue well. 

The supplementary section describes these points in a 
more formal way. It also discusses credibility of commit- 
ment and how, in practice, the monopolist can escape the 
Coase problem to some extent. That section is preceded 
by an example in which the monopolist produces a good 
that is recycled. The example is constructed in such a 
way that the modeling of the buyers’ expectations is 
irrelevant. Thus, it forms a simple introduction to the 
durable-good problem, It also serves as a background for 
a brief discussion of monopoly power in the aluminum 
market. 


1.1.4 Learning the Demand Curve 


Throughout this chapter—and most of the book—we 
assume that the monopolist knows his demand curve 
perfectly. One way of justifying this is to assume that the 
monopolist conducts market surveys. But such surveys 
are costly and imperfect, and they always leave some 
residue of uncertainty about the demand curve. A com- 
plementary way of learning demand is to experiment by 
changing prices over time, which usually allows a better 
estimation of the demand curve than keeping one’s price 
constant. 

There is a small literature on optimal intertemporal 
pricing by a monopolist in a Bayesian setting.'® There are 
few general conclusions about the price path to be fol- 
lowed by the monopolist, which obviously need not be 
monotonically increasing or decreasing over time. One 
thing is certain: When setting his price at a given date, the 
monopolist should not maximize expected current profit 
given his current (posterior) beliefs about the demand 
curve. Rather, he should also take into account the value 
of information thus obtained for future pricing. Aghion, 
Bolton, and Jullien (1988) and Lazear (1986) have studied 
models of a stable (nonstochastic) demand curve. Aghion 
et al, ask whether the monopolist eventually learns his 
demand curve and therefore charges the full-information 
monopoly price in the long run, The answer is intuitive. 
Suppose that it is initially known that the profit function 
is concave and continuous, but that its exact shape is 


unknown. Then the monopolist will not stop experiment- 
ing before reaching the monopoly price. For assume that 
he keeps his price constant from some period on. By 
charging a price slightly different from this price, he 
learns the slope of the profit function at this price, and he 
does not affect his expected current profit much. But 
learning the slope is very valuable for the future, and is 
therefore desirable. The trick is that, by altering his price 
by an arbitrarily small amount, the monopolist can make 
his experimentation costs arbitrarily small and still learn 
very useful information about the gradient of his objec- 
tive function. Aghion et al. show that nonconcavities or 
discontinuities in the profit function may prevent the 
monopolist from learning his true monopoly price even if 
the demand curve is deterministic. (See Rothschild 1974 
and McLennan 1984 for further results of finite experi- 
mentation in models that allow stochastic demands.) For 
instance, in the case of a nonconcave profit function the 
previous local-experimentation reasoning shows that the 
monopolist will eventually reach a local maximum of the 
profit function. To reach a global maximum, however, 
would require nonlocal experimentation (large changes in 
price), which may prove too costly if the discount factor 
is not sufficiently high. Thus, the monopolist may well 
settle for incomplete learning, even in the long run. Lazear 
(1986) looks at a simple case of learning and obtains a few 
interesting comparative-statics results. For instance, he 
shows how thin markets (such as that for a mansion) are 
likely to exhibit a fairly rigid intertemporal pricing pat- 
term, whereas thicker markets (such as that for a very 
ordinary condominium) will yield larger price changes, 
the idea being that in a thick market the seller learns more 
about the demand curve from observing current demand. 
Similarly, markets with a very diffuse prior probability 
distribution on the demand curve will also exhibit large 
price changes. 


1.1.5 Inventories 


It is assumed throughout most of the book that, in each 
period, sales originate from current production. In prac- 
tice, inventories may allow firms to separate production 
from sales. A sizable and interesting literature treats the 


18, The papers usually abstract from other int.rtemporal pricing considerations 
mentioned in this chapter (intertemporal substitution by consumers for durable 
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goods, inventories, goodwill, leaming by doing) to focus on the learning 
aspect. 
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dynamics of quantity and price adjustments, when a firm 
faces shocks and can smooth its price path and its pro- 
duction path through inventory holdings. For instance, 
Blinder (1982) analyzes how a monopolist’s production, 
inventories, and price adjust to demand shocks depending 
on whether these shocks are transitory or permanent. 
He assumes that in each period the marginal cost of pro- 
duction is increasing with output. Because of the cost 
convexity, the monopolist prefers a deterministic produc- 
tion to a random one with the same mean. In an inter- 
temporal context, this means that he prefers a stable pro- 
duction to a fluctuating one. Thus, he would like to smooth 
demand shocks over time; this is exactly what inventories 
allow him to do. Consider first a transitory (single-period) 
upward shock in demand. In Blinder’s model, in the 
absence of inventories, the price and the output adjust 
upward. They still do so in the presence of inventories, 
but to a lesser extent. The firm can reduce its inventory 
temporarily and replenish it later. The effect of a single- 
period increase in demand can thus be spread at the pro- 
duction stage over several periods. A permanent shock in 
demand cannot be smoothed as much. A high demand 
today implies a high demand in the future. That is, the 
marginal cost of production will be high tomorrow as 
well. Thus, production (as well as price) reacts more to a 
permanent shock than to a transitory shock.'® 

Another common theme in the literature on inventory 
behavior is the asymmetric price response to upward and 
downward shocks. In particular, Reagan (1982; see also 
Reagan and Weitzman 1982 for the competitive case) 
assumes that the monopolist can sell only from existing 
inventories. That is, there is a lag between the use of 
inputs and the availability of outputs. Current inventories 
act as a capacity constraint on sales in each period. When 
demand is high, output is determined entirely by inven- 
tories and the firm’s price adjusts so as to clear the market 
(ie., to satisfy demand). In contrast, when demand is low, 
the inventory constraint is not binding (sales are lower 
than inventories). The firm reacts both by choosing a low 
price and by reducing production. Because of this pos- 
sibility of quantity adjustment for low demand, but not 
for high demand, the monopolist’s price tends to react 


more to upward shocks in demand than to downward 
shocks, as Reagan showed.?° 


1.2 Cost Distortions 


In section 1.1 the emphasis was on the distortion on the 
demand side associated with a monopolist’s pricing be- 
havior. Monopoly power can also have perverse effects 
on the supply side. In particular, for given goods pro- 
duced by the monopolist and given quantities of those 
goods to be supplied to the consumers, a monopolist may 
produce at a higher cost than would a competitive firm. 
In particular, it has often been suggested that firms in a 
monopoly situation tend to pay little attention to cost- 
cutting strategies, engage in slack, and so forth. Hicks 
(1935), for instance, noted that “the best of all monopoly 
profits is a quiet life.” Machlup (1967) suggested that 
managerial slack can exist only if product markets are not 
perfectly competitive. These ideas may seem paradoxical; 
after all, the monopoly power is on the output side, and 
it is not easy to figure out why output distortions should 
have any effect on the cost of producing a given amount 
of a good. 

To investigate this question, we must go back to the 
concept of cost function—more precisely, to the dele- 
gation problem. As was discussed in the chapter on 
the theory of the firm, a firm's shareholders, who wish 
to maximize profits, may have a hard time monitoring 
and controlling the activities of the firm’s employees 
(executives, workers). The latter naturally seek objec- 
tives other than profit maximization, and unless the share- 
holders perfectly observe the technological environ- 
ment and the employees’ behavior (which is highly 
unrealistic) the firm is likely to engage in “X-inefficiency” 
(Leibenstein 1966). Indeed, we know from the chapter 
on the theory of the firm that, whatever the market struc- 
ture, the firm will generally be able to engage in such 
inefficiency (j.e., that Machlup's suggestion holds only 
in very special cases). The question here is how this 
inefficiency is affected by market power on the product 
market. 

As we saw in that preliminary chapter, shareholders 


19. See also the discussion of Blinder in Schutte 1983. 
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20, For other references on this topic, see Phlips 1980, 1983 and Amihud and 
Mendelson 1983 
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can use the performances of firms with related techno- 
logies (or demands) as a yardstick to control the perfor- 
mance of their firm. For instance, the shareholders may be 
suspicious of their firm's claim that it is facing adverse 
exogenous conditions when other firms known to face 
similar supply or demand conditions are doing well. In 
such a case, the managers’ use of the excuse that “times 
are hard” to conceal slack and justify low profits is not as 
credible as when there is no other firm with which to 
compare their firm. This “tournament” idea—basing the 
incentive structure of one’s firm on a comparison with the 
performance of related firms—does not rely on the exis- 
tence of competition in the product market; one could, a 
priori, compare the performances of two power plants 
generating power in two independent regions. But the 
same type of argument can be made when firms compete 
on the product market. Thus, it seems natural to base the 
rewards of Ford managers on the performance of General 
Motors. It can even be argued that, because the exoge- 
nous conditions facing two firms are more likely to be 
correlated when these two firms are in the same product 
market, yardstick competition will in practice be more use- 
ful in industries with several competitors than in product- 
market monopolies. 

This yardstick competition, when it is applicable, may 
explain why a competitive firm's managers are better 
controlled by the shareholders than a monopoly’s man- 
agers.?! However, Hicks’ statement is only half-true: Al- 
though a monopoly’s managers may engage in more slack 
(the “quiet life’), they may not benefit from it, because the 
slack is anticipated. In other words, their “participation” 
constraint may nevertheless be binding. A lower effort, 
Say, is then offset by lower rewards. 


meee! 


1.3 Rent-Seeking Behavior 


Section 1.1 described how monopoly pricing lowers con- 
sumer surplus and raises a firm's profit relative to a com- 
petitive behavior. The decrease in surplus exceeds the 


increase in profit by an amount equal to the dead-weight 
loss. Section 1.2 discussed how, for a given output, a 
monopoly position may inflate costs. These extra costs 
add to the dead-weight loss. This section discusses a 
third distortion associated with monopoly: the wasteful 
expenses incurred to secure or maintain a monopoly 
position. 

Consider the rent associated with monopoly pricing. 
Abstracting from the control problem discussed in section 
1.2 (so that the cost function can be defined independent 
of the monitoring technology), one can see that this rent 
is equal to the monopoly profit represented by the trape- 
zoid CEFAC in figure 1.1. It is clear that the existence of 
this potential rent may lead to rent-seeking behavior. 
Firms will tend to spend money and exert effort to ac- 
quire the monopoly position; once installed in that posi- 
tion, they will tend to keep on spending money and 
exerting effort to maintain it. 

A firm may incur both strategic and administrative ex- 
penses to obtain or keep a monopoly position. An exam- 
ple of a strategic expense is the research-and-development 
cost of obtaining a patent, which secures a monopoly 
position for the patented product (see chapter 10). Other 
examples are the accumulating of various forms of capital 
and the erecting of barriers to entry (chapter 8). Among 
the administrative expenses are the costs of lobbying and 
advertising campaigns aimed at influencing the public and 
its elected representatives (“Our firm is at the service of 
the consumer”) and of legal defense against charges of 
antitrust violations. 

Posner (1975) analyzes an extreme case of rent-seeking 
behavior in a contest between firms to become a monop- 
olist and concludes that all monopoly rents should be 
counted in the costs of monopoly. In other words, the 
actual dead-weight loss is represented in figure 1.1 by the 
area CEGFAC. The two main axioms leading to this con- 
clusion are the following. 


(1) rent dissipation: The total expenditure by firms to 
obtain the rent is equal to the amount of the rent. 


21, These reflections also imply that the public sector may be more inefficient 
than the private sector and, at the same time, not introduce any unnecessary 
inefficiericy. The reason is that in many countries the public sector encompasses 
many industries that are “natural monopolies.” Because of the existence of large 
fixed costs, say, an industry cannot be competitive and is nationalized or 


76 


regulated (e.g.. railroads, the postal service, electricity companies, the tele- 
communications industry). Hence, the public sector forms a biased sample in 
terms of product-market power, which naturally leads to more slack. However, 
the public sector may not be more inefficient per se, because many of its firms 
would have engaged in X-inefficiency anyway had they remained private 
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(2) socially wasteful dissipation: This expenditure has no 
socially valuable by-products.?* 


Axiom 1 is the zero-profit free-entry condition. The 
idea is that entry (or increases in rent-seeking expenses) 
occurs until the expected rent—i.e., the probability of 
obtaining the rent times the amount of the rent—equals 
the rent-seeking cost for each firm. In equilibrium, for 
instance, ten firms may spend $1 each to have a 10 per- 
cent chance of obtaining $10 in rent, in which case the 
total cost is equal to the rent. 

The plausibility of axiom 1 depends on the way the 
contest is organized. One cannot a priori measure rent 
dissipation without going into the microfoundations of 
the particular situation.?* Axiom 1 may not be satisfied 
for many reasons (see Fisher 1985). First, monopolies can 
be obtained through luck rather than through foresight. 
An extreme and somewhat contrived case is that of the 
patenting of a fortuitous invention. Second, and more 
important, the contenders may not begin on equal foot- 
ing; one firm may already have patents, access to par- 
ticular mineral resources, private information about tech- 
nology or demand, or incumbency advantages,?* which 
will make it the most powerful candidate for the monop- 
oly position. Because a firm's competitors may be less 
willing to spend money to obtain the monopoly position, 
it may be able to keep some of the rent. Consider the case 
in which firms must bid for the privilege of becoming a 
franchise monopoly. If all the firms are symmetric, the 
highest bid equals the (common) monopoly rent. With 
asymmetric bidders, however, the firm with the highest 
potential rent is able to keep some of the surplus. Third, 
even with symmetric or almost-symmetric firms, the rent 
need not be dissipated.?* 

Axiom 2 says that the expenses are socially wasteful, 
This may be the case when a regulated monopoly posi- 
tion (e.g, the allocation of import franchises) is allocated 


on the basis of lobbying influence.?° However, if the 
same monopoly allocation is allocated through an auc- 
tion, the expenses are received by the government and 
thus are not wasteful (in the symmetric case, axiom 1 is 
satisfied but axiom 2 is violated). There are also inter- 
mediate cases in which the expenses are somewhat waste- 
ful. For instance, when air-travel prices and entry on routes 
were regulated in the United States, airlines competed for 
customers (the “rent”) by offering lavish services. This 
type of rent-seeking behavior was not entirely waste- 
ful, because customers enjoyed the services. However, 
the same consumers would have happily traded some of 
these services for price reductions corresponding to the 
cuts in services. 

An interesting case is that of monopoly rents that are 
partially transferred to input suppliers. For example, a 
monopoly rent of 10 may be split into 5 for the owners 
of the firm and 5 for the workers if the union's bargaining 
power enables it to appropriate half of the pie. If this repre. 
sents a simple transfer from the owners to the workers 
(the labor supply is not altered by the redistribution), the 
“dissipation” of monopoly profit involves no social loss; 
the recorded profit (equal to 5) simply underestimates the 
monopoly rent (equal to 10). However, if the labor supply 
is affected by the redistribution (for instance, if the exist- 
ing workers respond to a higher wage by increasing the 
labor supply), some distortion in allocation is introduced 
as well. 

The bottom line is that rent-seeking behaviors cer- 
tainly waste some of the monopoly profit. That the monop- 
oly profit may be part of the welfare loss associated with 
monopoly is a well-taken point. However, we should 
refrain from drawing any general conclusion about which 
fraction of the monopoly profit should be counted as a 
welfare loss. Only a careful description of the rent-seeking 
game can allow us to give an order of magnitude for this 


22. A further assumption is that the inputs used to obtain the rent cannot be 
bid up (their supply is perfectly elastic), An example where they might be bid 
up is the case considered below of firms vying for favors from civil servants to 
obtain a regulated monopoly position, The rent, instead of being dissipated, 
may then be transferred to the civil servants (through bribes in extreme cases). 
But, as Krueger (1974) notes, becoming a civil servant in charge of the attribu- 
tion of these rents may lead to rent-seeking behavior at that stage. A seminal 
paper on rent-seeking activities is Tullock 1967. A useful discussion can be 
found in Varian 1987, 


23. See the discussion of patent races in chapter 10 of this book. 
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24. On incumbency advantages and Posner's approach, see Rogerson 1962. 
25, See the discussion of preemption games in chapter 8 


26, The analysis here is very vague, What is needed is an equilibrium model 
in which lobbying activities have influence. Incomplete information ought to 
be the key to building such a model that would explain why lobbying occurs 
(information, collusion with decision makers, and so on) and whether lobbying 
expenses are socially wasteful. 
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fraction. As the rent-seeking games vary considerably in 
practice, we are obliged to analyze the issue case by case. 


1.4 Concluding Remarks 


Monopoly power results in high prices and a dead-weight 
welfare loss. There may also exist other, more subtle 
distortions, such as X-inefficiency and dissipation of the 
monopoly profit. (The next chapter considers a further 
distortion associated with product selection.) 

Although pricing distortions are relatively well under- 
stood, cost distortions and rent-seeking behaviors have 
not yet been mastered by economists. To extend sections 
1.2 and 1.3 at a theoretical level and to develop empirical 
methodologies to measure such distortions are two chal- 
lenges posed by this chapter. 

Some mitigating factors balance these perverse effects 
of monopoly power to an extent. 

First, under increasing returns to scale, production by 
a single firm is technologically more efficient. Indeed, 
one of the most often-heard arguments in defense of the 
monopolization of an industry is that it prevents a waste- 
ful duplication of fixed costs, Williamson (1968) questions 
the refusal of the U.S. courts to recognize a defense of 
economies of scale in horizontal-merger cases under the 
Clayton Act.*” He argues that, under reasonable assump- 
tions about the elasticity of demand, only a small reduc- 
tion in fixed costs is necessary to offset the dead-weight 
loss created by the price increase in the case of a merger. 

Second, as Joseph Schumpeter suggested, monopoly 
may be a necessary condition for a decent amount of 
research and development. In particular, innovation may 
require the assignment of monopoly property rights 
(patents).?# 

One cannot express a view on the merits of monopoly 
without considering its alternatives (e.g., competition, 
regulated monopoly) and the ways in which these alter- 
natives may be fostered or obstructed (e.g., subsidies, 
antitrust proceedings, regulation). The relevance of the 
various arguments for and against monopoly eventually 
depends on the relative efficiency of all arrangements”? 


and on the information possessed by antitrust, regulatory, 
and other governmental authorities who promote them. 
This chapter, like most of this book, is more satisfactory 
at the positive level (How do firms behave on the pro- 
duct market?) than at the normative one (How should 
the government correct distortions?). Another challenge 
offered by this chapter is to develop the normative side. 


27. Note, however, that economies of scale are considered in merger cases 
under the present Department of Justice guidelines. 


28. We shall return to this argument in chapter 10. 


29. For instance, chapter 6 is mainly concemed with the question of whether 
the pricing distortion is eliminated by competition. 
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a es 
1.5 Supplementary Section: 
Durable Goods and Limits on Monopoly Power 


In this section, we examine how a durable-good monop- 
olist creates his own future competition. The central 
theme is that his monopoly power can be eroded by the 
existence of this nurtured competition. We start with a 
case of a good that has a short lifetime, after which it can 
be recycled by a competitive industry. The purchasers of 
the good dispose of it at the end of the lifetime, and this 
allows us to ignore the purchasers’ expectations about 
future prices. Although extreme, this case offers a simple 
and instructive introduction to the subject. In the second 
example (an “intertemporal price-discrimination problem’), 
we consider a good that does not depreciate, and we 
focus on the role of consumers’ expectations. This exam- 
ple shows how consumers who anticipate a price decrease 
restrict their purchases, 


1.5.1 Recycling 


Consider the case of a monopolist producing a good 
that is recycled by a competitive industry. As a motiva- 
tion for this case, recall the famous 1945 U.S. Supreme 
Court case concerning the Aluminum Company of Amer- 
ica (Alcoa). Alcoa had about 90 percent of the primary 
aluminum market. It was considered a monopoly, and was 
prohibited from expanding (in that the court ordered that 
the aluminum plants built by the govemment during the 
war nat be sold to Alcoa), which led rapidly to a more 
competitive market in primary aluminum.*° Some econo- 
mists opposed the court's decision on the grounds that 
there already existed an approximately competitive indus- 
try, independent from Alcoa, that recycled the aluminum 
Alcoa produced. If this secondary market for aluminum 
was taken into account, Alcoa’s market share was only 
64 percent. In fact, the price charged by Alcoa seemed 
moderate for a monopolist. Some even suggested that 
Alcoa’‘s price was close to its marginal cost. Let us ex- 
amine this argument using a simple model.?! 


Consider discrete time periods labeled 1, 2,..., #. Sup- 
pose that there is a demand function in each period: 
4, = D(p,). This demand corresponds to the consumption 
demand for aluminum (primary or secondary). Let p;, 
= P(q,) be the inverse demand function, The aluminum 
consumed in period f is either lost or recycled by a com- 
petitive industry. Let x,,, € [0,1] be the fraction of the 
aluminum that is recycled. The recycling cost is C(x,4;), 
where C is a convex, increasing function (i.e., the re- 
cycling technology exhibits decreasing returns). More- 
over, assume that C(0) = 0, that C’(0) = 0, and that C(1) 
= +00 (it is impossible to recoup the entire input). If 
Pr+1 is the price of aluminum (primary and secondary) in 
period t + 1, the recycled fraction x,,, is 


Poss = C's) 


(the competitive recycling industry recycles until its mar- 
ginal cost equals the price of aluminum). We can then 
write z,,; as an increasing function of p,4;: 


Tita = X(Pr+i)- 


Remark We are implicitly assuming that the profits from 
recycling (which are positive because the recycling cost 
function is convex) accrue to the recycling industry. In 
other words, the buyers of aluminum at date t dispose of 
their used aluminum at date + + 1, This assumption al- 
lows us to write a per-period demand function p, = P(q,). 
As will be seen below, if the consumers are able to reuse 
the good or resell it, their demand at date t depends on 
the price they expect at date f + 1, say. The anticipa- 
tions about future prices must then be modeled. One way 
of justifying this assumption is to envision a recycling 
industry composed of a large number of recycling firms 
so that none of them has any power on the (primary plus 
secondary) aluminum market (j.e., they are price takers). 
Each of these firms, however, has a local monopoly power 
in its geographically delineated input market. Thus, they 
can charge the monopoly price to obtain the scrapped 
aluminum; ie., if the aluminum cannot be used without 
being recycled, it is obtained for free by the recycling 


30. The Supreme Court did not actually hear the Alcoa case. Too many of the 
justices had conflicts—because the case had taken so long to get through the 
court system, a majority of the justices had served in the Justice Department 
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while the case was in progress. A special three-judge Appeals Court panel was 
established to make the final resolution of the case. 


31. The following discussion is based on Martin 1982. See also Gaskins 1974 
and Swan 1980. 
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firms. Thus, the recycling firms are competitive only at 
the output level. 


Suppose that primary aluminum is produced by a mo- 
nopolist at a constant unit cost c. The monopoly profit in 
period t is 


7, = [P(q.) — he — 2141-1)» 


(Note that qo = 0.) q; is the total production of aluminum 
(new plus recycled), and q, — ,q,-; is the new aluminum 
provided by the monopolist. 

Assume that the monopolist maximizes the present 
discounted value of his profits: 


n=¥ sn, = 
& ty MNaee 


<1 


It is left as an exercise for the reader to show that, in a 
stationary state (in which, by definition, quantities and 
prices are constant over time), 


(p — (1 — dx — 2'P'g) = —P'(1 — xq. (1.7) 


Furthermore, it is assumed that the second-order condi- 
tions are satisfied. 

Since x’ > and P’ <0, equation 1.7 indicates that 
p>c. In fact, the long-run price for aluminum can be close 
to the competitive price only if the fraction recycled is close 
to 1. This does not seem to have been the situation in 
the Alcoa case (in addition, one must take into account 
the fact that during the period under consideration 
the demand was increasing greatly, so that even if x 
had been high Alcoa's market share would have been 


maintained). 
Let 
D'p P 
e= —— = —-— 
D Pq 


be the elasticity of demand. Equation 1.7 can be rewritten 
as 


Vos 
Pp e\1—6éx—x'P'q) 


Since 6 <1,x<1,x'>0,P’<0,and(p—c)>0, 


Therefore, in this situation, the relative profit margin 
(Lerner index) is lower than the one chosen by a monopolist in 
an industry without recycling (1/e). In fact, in the long run, 
consumers benefit from the existence of recycling. The 
following can also be shown: 


+ An improvement in the recycling technology dimin- 
ishes the monopoly rent.>* Indeed, when the recycling 
technology is very inefficient, almost none of the alumi- 
num is recycled, and the monopoly realizes its static 
monopoly profit in every period. 

+ If consumers benefit from the existence of recycling 
in the long run, they take a loss when it is first introduced. 
Suppose that until period 2 there is no recycling. Com- 
pare market prices in period 1 depending on whether or 
not there will be a recycling industry starting in period 2. 
If the monopolist anticipates the future existence of re- 
cycling, he reduces his output relative to that of the static 
optimum (given by (p — c)/p = 1/e) so that he will not 
“nurture” future competition. Therefore, prices are higher 
during period 1 for consumers. In the longer run, recy- 
cling increases the supply of the product, and—in spite 
of the initial reduction in monopoly output—the price 
decreases. 

+ Growth in the aluminum market increases the monop- 
olist’s profit margin. (Here, we assumed that the market 
was stationary.) The basic idea is that during a period of 
market growth, recycled aluminum—which depends on 
the lower past demand but not on the higher present 
demand—obtains a lower market share. 


For an analysis of the decomposition of the technology 
of recycling of raw materials and a study of recycling 
proper and of the effects of vertical integration, see Mar- 
tin 1982. 


1.5.2 Durable Goods and Intertemporal Price 
Discrimination 


Assume now that the consumers can enjoy a certain du- 
rable good during several periods. The price at which 


32, The monopolist may benefit from an improvement in the recycling tech- 
nology if its production cost c is high and the consumers are able to obtain a 
substantial part of the rent associated with recycling (in contrast with the 
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assumption made above), Recycling then makes the good more desirable to the 
consumers, See, however, the discussion of intertemporal price discrimination 
below. 
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consumers are willing to buy the good today then de- 
pends on their expectation of the price at which they will 
be able to buy it tomorrow, because today’s purchases are 
an (imperfect) substitute for tomorrow's. We first set up a 
simple two-period model, which illustrates the main ideas. 
We show that, whenever feasible, the manufacturer of the 
durable good prefers to lease it rather than to sell it. 
We then consider the general issue of intertemporal price 
discrimination and its most extreme form, the so-called 
Coase conjecture. This conjecture (now a result) asserts 
that the producer of an infinitely durable good loses all 
his monopoly power when the period between his price 
adjustments converges to zero. This result must be qual- 
ified by the fact that, in many situations, the monopolist 
may be able to recoup some of his monopoly power, as 
we will see. We then consider the implications of inter- 
temporal price discrimination for the monopolist’s choice 
of the durability of the good. 


1.5.2.1 Leasing versus selling 


When a good (e.g., a computer or a photocopying ma- 
chine) is durable, its producer has the option to lease or to 
sell.>* We will consider here the idea that a monopolistic 
producer of a durable good prefers leasing to avoid the 
problems of intertemporal credibility associated with sell- 
ing. This idea can be illustrated with the help of a very 
simple model. There are two periods: t = 1, 2. The good 
produced and used during period 1 may be used again in 
period 2, with no depreciation. For simplicity, assume that 
after period 2 the good becomes obsolete (is replaced by 
a new product), and hence there is no demand for it. The 
model and the ideas can easily be generalized to the case 
in which the good does not become obsolete. To simplify 
the calculation, assume that the cost of producing this 
good is zero, so that the monopolist can produce as much 
as he wishes in each period without incurring any cost. 
The monopolist and the consumers have discount factor 
5 = 1/(1 + 1), where r is the interest rate. The consump- 
tion (utilization) demand for this good in each period is 
D(p)=1—p. 


The monopolist has two options: (1) to lease the good 
in each period and (2) to sell in each period. In the latter 
case, a resale market exists in which the good bought 
during the first period may change hands in the second 
period. In each period, the owners of the good may lease 
it to other consumers if they wish. 

Compare the two options: 

(1) Suppose that the monopolist decides to lease. His 
price in each period t maximizes p,D(p,). The monopolist 
charges p, = p; =4. Then he produces g, = 4 during 
period 1 and q, = 0 during period 2 (since there is no 
depreciation). The present discounted value of his inter- 
temporal profit is 


m=3}+46=4(1 + 6). 


(2) Suppose that the monopolist decides to sell. The 
quantity sold during period 1 is “re-offered”** on the 
market during period 2. Having sold quantity q, in period 
1, the monopolist chooses to sell quantity 4, (which max- 
imizes his profit) during period 2. The accompanying 
price, pz, is that for which the total quantity offered 
(q; +42) is equal to the quantity demanded; that is, 
P2 = 1 — q, — qz. Therefore, to maximize his profit, the 
monopolist chooses q, in order to solve 


max 9,(1 — q; — 42). 
a 


From this, we determine q, = (1 — q,)/2. The profit in 
period 2 is then (1 — q,)?/4. 

Now let us examine the first period. The price that the 
buyers of the durable good are willing to pay (whether 
they will use it themselves or lease it) depends on their 
expectation of the market price during period 2. Let p3 be 
this expected price, The consumers are willing to pay 
(1 — q,) + Sp, since the current rental price is 1 — q;. 
Therefore, we have 


Pr = (1 — qy) + Opp. 


To complete the model, we assume that the consumers 
anticipate correctly the price charged in period 2: p} = p . 
From this, knowing q,, they expect the producer to sup- 


33. In certain cases, the leasing option is less attractive or not relevant at all 
For example, the leasing market for cars is rather small relative to the sales 
market because of both moral hazard (consumers take little care of the car) and 
potential adverse selection problems (the market would tend to attract the less 
careful drivers). 
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34. To say that the good is reoffered does nat imply that it will necessarily 
change hands. A consumer and owner of the good during period 1 is prepared 
to lease it to another consumer during period 2 when the market price exceeds 
the value he himself attaches to the good 
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ply q. =(1—4)/2 during period 2 with a correspond- 
ing price: 


Therefore, we derive 


1= é 
p= (1-41) +65") =(1 = a(t +$). 


Note, in particular, that the quantity demanded at price p; 
is lower than in the case where the monopolist would 
commit himself not to produce during period 2 (in this 
case, p, = (1 — 4,)(1 + 4). Note also that the first-period 
price necessarily exceeds the second-period one. Then the 
monopolist chooses q; to maximize 


6 (1—)? 
r= aay [st = me + 3) + stan", 


It is left to the reader to check that 
qy = 2/4 + 8), 
that 


_ (246% 146 
P= Fla + 8) 2 


and, more important, that II* < II", Because of this last 
inequality, the monopolist prefers leasing.** 


1.5.2.2 The Coase Problem 


Why does selling create a problem for the monopolist? 
The reason is that the consumers (or investors) are un- 
ready to pay a high price for the good during period 1, 
knowing that the monopolist can “flood the market” in 
period 2—which, in effect, he does. (Equivalently, the 
monopolist could make the good bought in period 1 
obsolete by introducing a new model.) To simplify, con- 
sider the case in which the linear demand curve stems 
from a continuum of consumers with unit demands and 
per-period willingness to pay in [0, 1]. Suppose that the 
monopolist charges the monopoly price (1 + 6)/2 in 
the first period and that, naively, those consumers 


whose per-period willingness to pay exceeds } buy. In the 
second period, the monopolist faces a residual demand 
D(p)=4—p from those consumers whose per-period 
willingness to pay is lower than 4. This tempts the 
monopolist to lower his price (to 4 in this case). Thus, 
in retrospect, some consumers with willingness to pay 
near 4 would have wanted to refrain from buying in 
the first period. For instance, a consumer with willingness 
to pay equal to $+ 6, where e is small and positive, 
enjoys a surplus of e(1 + 5) if he buys naively and 
a surplus of 6({ + €) > e(1 + 5) if he waits. Therefore, 
the prospect of a price adjustment tomorrow changes 
the monopolist’s demand curve today. Facing a lower 
demand under perfect foresight than under naive expec- 
tations, the monopolist is forced to charge a lower first- 
period price. Here we have the phenomenon of inter- 
temporal price discrimination. In equilibrium, only the high- 
valuation consumers buy in the first period at the high 
price; their high surplus for the good leads them to pur- 
chase then rather than to wait for a low price, Consumers 
with an intermediate valuation buy at the low price in the 
second period. Those with a low valuation do not buy 
at all. 

The monopolist suffers from the consumers’ rational 
belief that he will Hood the market. This problem takes an 
extreme form in the following setting: Suppose that both 
the monopolist and the consumers are infinitely lived and 
that the good is infinitely durable. The consumers have 
unit demands. Each consumer's valuation now represents 
the present discounted value of the services brought by 
the good from the date of purchase on. Assume that the 
consumers’ valuations are distributed on [c, +00) accord- 
ing to some smooth density function, where c is the unit 
production cost of the good. (Consumers with valuations 
under c are irrelevant because the monopolist will never 
charge a price under c, as can easily be shown.) Let 
5 = e~", where ris the rate of interest and A is the length 
of time between the price adjustments. The Coase conjec- 
ture (Coase 1972), proved formally by Bulow (1982) and 
Stokey (1981) for particular demand functions and equi- 
libria and by Gul, Sonnenschein, and Wilson (1986) for 


'35. This result is contingent on the absence of threat of entry. Bucovetsky and 
Chilton (1986) and Bulow (1986) show that a monopolist would sell some if he 
tried to deter a rival's entry. A sale lowers the potential demand for the entrant, 
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whereas leasing does not commit customers to stay with the incumbent firm in 
case of entry. This idea is somewhat related to the market-foreclosure issue 
discussed in the supplementary section of chapter 4. 
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more general demand structures,*® states that when A 
tends to zero the intertemporal profit tends to zero. In 
other words, a monopolist who can change his price very 
quickly (as would be expected) loses his monopoly power 
completely. In equilibrium, consumers expect him to 
charge prices close to the competitive price c at any future 
instant and, as they can wait for the next offer without 
much delay cost, they cannot be induced to accept higher 
prices. Thus, the monopolist ends up charging prices 
close to the competitive price, vindicating the consumers’ 
belief. 

The appendix to this chapter gives a heuristic proof of 
the Coase conjecture; exercise 1.8 explores the mechanics 
of the reasoning in a simple case. 


Exercise 1.8°***7 Both the monopolist and the consumers 
are infinite-lived. The unit production cost is 0. The con- 
sumers’ valuations, v, are uniformly distributed on [0, 
1/(1 — 6)] (which amounts to saying that the per-period 
valuation is uniformly distributed on (0, 1]). A consumer 
with valuation v has utility 6"(v — p,) if he buys at date 
t at price p,, where 6 is the discount factor. The monop- 
olist's intertemporal profit is 


D Oran 
A 


where q, is the quantity sold (the number of consumers 
who buy) at time #. Look for a linear and stationary 
equilibrium: When facing at some date a price p, any 
consumers with valuations exceeding w(p) = Ap buy 
and consumers with lower valuations do not buy, where 
A> 1. Conversely, if at some date consumers with 
valuations exceeding v have bought and the others 
have not, the monopolist charges p(v) = uv, where 
w<l 

(i) Compute the monopolist’s intertemporal payoff from 
date ¢ on when only consumers with valuations below v 
remain and the monopolist charges p,. P:+1,... and the 
consumers follow their linear rule. 


(ii) Show that the monopolist’s optimization over p, 
leads to a linear rule, where 4 is given (implicitly) as a 
function of 4 by 


1— 2Ap + dA? yu? = 0. 


(iii) Write consumer w(p)’s indifference equation to 
obtain 


(A — 1) = 6A(1 — p). 


(iv) Conclude that when 6 tends to 1, the monopolist’s 
profit tends to zero. 


1.5.2.3 Evading the Coase Problem 


The credibility issue drives the durable-good monopo- 
list’s profit to zero in the Coase conjecture. Although this 
problem is a serious one (for the monopolist, not the 
consumer), there exist many reasons why a durable-good 
monopolist can in practice make a profit, We now inves- 
tigate these reasons. 


* As we have seen, leasing (or renting) allows the mo- 
nopolist to keep clear of the Coase problem. The intuition 
is that the good is implicitly retumed to the manufacturer. 
When flooding the market, the manufacturer puts pres- 
sure on the price of his own good but not on the quantity 
owned by the consumers {as in the selling case). The 
monopolist thus realizes the static monopoly profit in 
each period. It is interesting in this respect that the U.S. 
government has required the dominant firm(s) in some 
industries (computers, copiers, and shoe machinery) to sell 
rather than rent. 

Leasing, however, may create some serious hazards 
that were not formalized in the model. If the consumers’ 
consumption mode (maintenance, care, etc.) matters, the 
monopolist must be able to monitor at the end of each 
period the exact condition of the good. Such a monitoring 
technology, however, may be extremely costly, and leas- 
ing may lose its virtues. This is one reason why auto- 
mobiles are more commonly sold than rented. 


36. See also exercise 1,8 (which is inspired by Sobel and Takahashi’s [1983] 
treatment of exponential demand curves), See Fudenberg et al. 1985 for the 
derivation of the Coase conjecture when the buyers’ valuations are bounded 
away from the monopolist’s production cost (in this case, the Coase conjecture 
says that the monopolist sells at a price close to the buyers’ lowest valuation), 
Gul et al, (1986) consider the (often more realistic) case in which the buyers’ 
lowest possible valuation is lower than the production cost. Their derivation of 
the Coase.conjecture assumes that the buyers’ decision on whether to accept a 
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price depends only on this price and not on the history of the market. Ausubel 
and Deneckere (1986) challenge this assumption and show that many outcomes 
(including, possibly, outcomes near the monopoly one) may be obtained once 
it is dropped. See Ausubel and Deneckere 1987 and Gul 1987 for discussions 
of the durable-good problem in the context of oligopolies 


37. This exercise follows Sobel and Takahashi 1983. 
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Leasing may face other hazards when the customers 
are not anonymous (and anonymity cannot be restored 
through the transfer of the good between customers). 
Then, within a given period, the monopolist can dis- 
criminate between customers on the basis of their past 
consumption. Customers who have leased in the past 
have signaled a high willingness to pay for the good, and 
therefore should be charged high rental rates. This re- 
duces the demand for the good considerably in the early 
periods of the relationship between a customer and the 
seller, Indeed, it can be shown that in the absence of 
buyer anonymity the monopolist does even worse when 
leasing than when selling: fit < II* (where the sale profit 
TI* is not affected by whether the customer is anonymous, 
because once the good is sold no discrimination is feasible 
any more).?® In such a situation, it seems that the seller 
would have an incentive to at least mimic a sale contract 
by offering a long-term lease at guaranteed prices. This 
would protect the customer against future price: discri- 
mination based on his current consumption, and would 
raise current demand. In this respect, Hart and Tirole 
(1987) prove that if the two parties can sign a long-term 
contract (which is enforced if either of the parties wants it 
to be enforced, but can be renegotiated if both parties find 
it advantageous to do so),>* the market organization with 
long-term leasing contracts is the same as that for the sale 
of a durable good without commitment. That is, every- 
thing is as if the good were sold to the customer. The 
price and consumption dynamics are as described in the 
subsections on “leasing versus selling” and “the Coase 
problem,” and the monopolist’s profit is equal to TT’. 


Now assume that leasing is impossible (say, because of 
moral hazard on the consumer side). We will see that in a 
number of circumstances the monopolist can evade the 
Coase problem, at least partially. 


+ Notice first that the monopolist can achieve the same 
(optimal) profit as under leasing if, at date 1, he can 
commit himself credibly to a sequence of prices. Assume in 
our two-period example that he announces p, = (1 + 6)/2 
and p, > 4 that is, he quotes the intertemporal mo- 


nopoly price in the first period and commits himself not 
to lower the price in the second. The quantities bought 
are q, = + and q, = 0, and consumers are indeed willing 
to pay (1 + 5)/2 in the first period. Hence, the monop- 
olist’s profit is 


pm = (1+ 6)/4 = TI. 


An important lesson of the previous analysis is that com- 
mitment must be credible. In the second period the 
residual demand facing the monopolist is q, = + — p2- 
Thus, as before, the monopolist would ex post wish to 
reduce the price under 4. But, if this were possible, the 
consumers would then refrain from buying in the first 
period. Thus, ex ante the monopolist would actually be 
hurt by his ex post flexibility. This result is actually much 
more general: An economic agent can always do as well when 
he can commit himself as when he cannot. This is because, 
under commitment, he can always duplicate what he does 
under no commitment. For instance, in this case he could 
announce at date 1 the two prices that would prevail 
under noncommitment. The consumers’ behavior would 
be unaffected, because we assumed that under noncom- 
mitment they have rational expectations. This simple 
paradox—that one generally gains by imposing self- 
constraints—is an important phenomenon in industrial 
economics. We will encounter it again—for instance, in 
chapter 8. 


Remark The fact that in the optimal commitment price 
pattern the price decreases over time is an artifact of our 
assumption that the good becomes obsolete after two 
periods. Actually, one can see that the price per period of 
utilization (where the second period is discounted) is con- 
stant and equal to the static (per period) monopoly price. 
This result is very general. Exercise 1.9 demonstrates the 
result for a good that never becomes obsolete. 


Exercise 1.9°** Consider the framework of exercise 1.8 
(infinite-lived consumers and monopolist and an infinitely 
durable good). In contrast with exercise 1.8, assume that 
the monopolist commits himself to a sequence of prices 


(Par Par Pass) 


38. See Hart and Tirole 1987. Another result is that when the relationship is 
sufficiently Jong and the discount factor “not too small (but not necessarily 
close to unity) the monopolist loses all power to discriminate; with two poten- 
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tial per-period valuations for the customer, the monopolist keeps charging the 
lower of the valuations until close to the end of the horizon. 


39. The concept of mutually advantageous renegotiation was introduced by 
Dewatripont (1985). 
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(i) Show that, when searching for the optimal price 
policy, the monopolist restricts himself to sequences satis- 
fying p, > p2 > pr >> 

(ii) Set up the monopolist’s optimization problem and 
derive the first-order conditions. 

(ii) Show that an optimal sequence is p; = p, = p3 
p™ = 1/2(1— 8). 


In practice, there are several ways for the monopolist 
to commit himself: 


+ A possibility rarely practiced is for the monopolist to 
put into escrow with a third party (an “arbitrator”) a suffi- 
cient amount of money, with the clause that, if in the 
future he produces more than the specified quantity or 
charges less than the specified price, he will lose this 
money to the third party.*° 

+ The monopolist’s reputation can enter into consideration 
when the relationship between the producer and the con- 
sumers is long-lived. For example, DeBeers, the diamond 
monopoly, has the reputation of refusing to allow prices 
to fall. 

+ To commit himself not to add to the existing stock of 
the good tomorrow, the monopolist may destroy his fac- 
tory after today’s production (if he is incapable of rebuild- 
ing another one at little expense). For instance, an artist 
can destroy the limestone used to create a lithograph. 
(Another strategy would be to number the lithographs 
and indicate the total number produced.) 


* Ina less extreme way, the existence of increasing mar- 
ginal production costs (decreasing returns to scale) prevents 
the monopolist from flooding the market too fast (Kahn 
1986); thus, increasing costs allow the monopolist to 
commit himself not to cut prices in a Coasian fashion.** 

+ The monopolist can offer a money-back guarantee (some- 
times called the “most-favored-nation” clause) if he lets 
the price of the good fall. In the above model, he can 
charge p, = (1 + 5)p™ in the first period (where p™ = 4) 


and commit himself to reimburse p™ — p, to his first- 
period customers if the second-period price p, falls under 
p™. Formally, everything is as if consumers paid 


(1 + d)p™ — 5(p™ — pr) = p™ + dp, 


—that is, as if they were charged the one-period monop- 
oly price in the first period and p, in the second period. 
Hence, the monopolist has an incentive to charge p, = p™ 
in the second period—that is, not to produce in the 
second period if the good is perfectly durable. Thus, the 
monopolist is, de facto, able to commit himself not to 
lower the market price, and is thus able to realize the 
monopoly profit. The intuition for this result is that the 
monopolist reimburses the consumers for any capital loss 
his opportunistic second-period behavior would inflict 
relative to the promised (commitment) behavior. Thus he 
totally internalizes the consumers’ concern about price 
reductions. 

Such price protection clauses may be hard or costly to 
enforce. The seller must be unable to give secret price 
discounts to new customers. (In May 1963, when General 
Electric announced that if it were to lower the prices of its 
turbine generators it would grant retroactively reductions 
to customers who had purchased them in the preceding 
six months, it also hired a public accounting firm to audit 
its compliance with the price-protection policy upon the 
request of any customer.) Furthermore, the seller must be 
unable to affect quality. Thus, price-protection policies are 
difficult to enforce in industries where goods are tailored 
to the customer. In such an industry, a price cut can be 
dissimulated in a quality improvement. That is, the good 
must be a well-specified and standard one to allow price 
protection.*? 

+ The durable-good monopolist may have an opportunity 
cost of staying in this market. Suppose that, for some 
reason,*? the monopolist produces the good at the date 
of supply (i.e, cannot produce everything at date 0), and 
that production at each date involves a fixed cost inde- 


40. This scheme would not necessarily “work,” because ex post (in the second 
period. say) the third party and the monopolist would have an incentive to 
renegotiate their contract. The third party. knowing that he would not receive 
the money in escrow if he did not renegotiate, would be willing to accept a 
small bribe to renegotiate and let the monopolist lower his market price. 


41, See Moorthy 1980 for a model in which the monopolist’s capacity con- 
straint is unknown to consumers. 
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42. This is hardly the case for turbine generators, However, General Electric 
also published a price book containing fixed and simplified pricing formulas. 
The price of a turbine generator was to be computed according to the book 
(depending on the specifications of the various parts) and then multiplied by a 
uniform multiplier. The price protection then applied to a single price: the 
multiplier. For more on pricing in the electrical industry see Sultan 1975. 


43. For instance, the monopolist could be capacity constrained, or storage 
might be expensive 
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pendent of the scale of production. (Alternatively, one 
could think of a fixed marketing cost, or of the oppor- 
tunity cost of the managers’ time—i.e., the profit that 
could be realized if the firm were to make another product 
instead.) The monopolist keeps on producing the good as 
long as the current profit associated with it exceeds the 
fixed cost. This implies that the Coase reasoning breaks 
down: If either the price or the quantity bought were 
to tend to zero at some point in time, the monopolist 
would exit the market. This, in turn, puts pressure on 
the buyers to buy before the good disappears from the 
market.** 


+ The buyers may not be informed about the exact mar- 
ginal production cost of the durable-good monopolist. 
Even under the conditions of the Coase conjecture, this 
allows the monopolist to realize some profits when he has 
a low production cost—he can always duplicate the pric- 
ing strategy that he would use if he were to have a high 
production cost, and earn at least the corresponding cost 
savings on the units sold. 

An interesting possibility in this respect, analyzed by 
Vincent (1986) and Evans (1987), is that the producer 
may signal quality through delays. To be more precise, 
suppose that the producer is either a high-quality pro- 
ducer or a low-quality one. Buyers value quality, but they 
do not know it before purchasing. A high-quality seller 
faces a higher marginal cost of production than a low- 
quality one, which implies that a low-quality seller is 
more impatient to sell (his margin is higher at any given 
price). This suggests that in equilibrium the seller of a 
high-quality good delays trade in order to “demonstrate” 
that he is a high-quality seller. Vincent and Evans confirm 
this intuition and show that a non-negligible delay arises 
even for rapid price offers. (The reader will find it easier 


to understand this result after studying Akerlof’s lemons 
model in chapter 2.**) 


+ Last, there may be a constant inflow of new customers. 
The inflow of new customers at each instant raises the 
demand curve (normalized by the number of customers) 
because the consumers who wait longer to buy are those 
with the lowest valuations. Thus, “on average,” the old 
customers in the market (those who were around, but 
have not bought yet) have lower valuations than the new 
customers. In a sense, the existence of new customers 
reduces the monopolist's propensity to cut his price. Con- 
lisk, Gerstner, and Sobel (1984) show that the constant 
inflow of new customers leads to “price cycles”; The 
monopolist, from time to time, offers a sale to cater to the 
existing stock of low-valuation buyers. For a few periods, 
he charges high prices so as to extract the surplus of high- 
valuation buyers, until the proportion of low-valuation 
buyers in the sample of unserved customers becomes so 
high that he cannot resist selling to these customers at 
a lower price. This sale temporarily reduces the propor- 
tion of low-valuation buyers among potential customers, 
and the monopolist starts charging high prices again.*® 


Which of these many factors mitigating the Coase re- 
sult play a role in practice depends on the industry under 
consideration. However, in general, the above analysis 
suggests that, although the intertemporal credibility of 
the pricing behavior is a serious problem facing durable- 
good monopolists, one should not expect that it forces 
them to charge the competitive price and make no profit. 


1.5.2.4 Monopoly and Planned Obsolescence 


From the preceding discussion, we can easily derive a 
theory of planned obsolescence. Suppose that the dura- 


44, Fudenberg et al. (1987), who argue that the existence of the fixed cost 
commits the monopolist not to charge low prices. show how a finite-horizon 
timing can be endogenized from the existence of outside opportunities. (There 
are some differences, however, with the case of an exogenously fixed horizon.) 


45. To get a very rough idea of this now, suppose that a high-quality good is 
valued 5 by the buyer and costs 4 to the seller and that a low-quality good is 
valued 1 by the buyer and costs nothing to produce. The two qualities are 
equally likely @ priori, Suppose that, as in the Coase conjecture, all socially 
desirable trade takes place almost instantaneously. Here this means that the two 
parties trade immediately with probability 1. Because trade occurs almost 
instantaneously, it must take place at a single price (more precisely, at prices 
very close to each other). Furthermore, this price must exceed 4 in order for a 
high-quality seller to be willing to trade. The buyer's expected surplus is then 
at most 


45) + $1) -—4= —1<0 


—which is impossible, as he can refuse to trade. Hence, non-negligible delay 
(inefficiency in the trading process) must occur. In chapter 2 we will see that 
asymmetries of information in static markets (no sequence of trading prices) 
generally imply that gains from trade are not realized (which can be viewed as 
an infinite delay in trading), 


40. Another way of introducing a steady-state demand for the durable good is 
to assume that the good depreciates over time. (In the extreme case of full 
depreciation, the good is nondurable and the firm enjoys full monopoly power 
over the demand curve, i.e., is not handicapped by price commitment). Bond 
and Samuelson (1984) Suslow (1986) analyze the problem of a durable- 
good monopoly in conjunction with depreciation. 
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bility of a good now becomes a decision variable for the 
monopolist and that the consumers are informed of the 
product's durability. 

First, assume that the monopolist can commit himself 
to a sequence of prices, or that (equivalently) he can lease 
the good. The leasing interpretation actually leads to a 
straightforward derivation of the optimal durability for 
the monopolist. On this interpretation, the monopolist 
is the owner of the stock of the durable good at each 
instant. For any plan he formulates as to the intertem- 
poral evolution of the stock of the durable good, he has 
an incentive to choose durability so as to minimize the 
intertemporal production cost. Thus, conditional on this 
plan, the monopolist chooses the socially optimal (cost- 
minimizing) durability. In this sense, there is no “planned 
obsolescence”; the monopolist does not produce a good 
with an uneconomically short life so that customers have 
to make repeat purchases. Thus, we obtain Swan's optimal- 
durability result in our context (see Swan 1972 and the 
discussion of durability in chapter 2 below). 

The picture changes dramatically in the sale (non- 
commitment) case. Then, by decreasing durability, the 
monopolist reduces the quantity of the good carried over 
to the next period, increases the next period’s residual 
demand, and therefore increases the next period’s price. 
Thus, reducing durability is a way of committing oneself 
not to lower the price tomorrow, which induces con- 
sumers to buy today. (When the monopolist can commit 
himself directly, this effect is irrelevant. Hence, there is no 
point to distorting the durability decision.) We see that 
the monopolist has an incentive to choose a durability 
below that which miimizes his intertemporal cost of pro- 
duction—i.e,, to plan obsolescence. A typical example of 
an “obsolescence-planning monopolist” is the textbook 
publisher who frequently introduces revised editions. 
Doing so kills off the secondary (used-book) market and 
is thus tantamount to producing a low-durability good. 
(A slight difference with our model is that the durability 
of the good is anticipated rather than observed at the date 
of purchase.) 

The following exercise (based on Bulow 1986) makes 
these ideas slightly more formal.*7 


Exercise 1,10** Consider the previous two-period model 
(in which the good becomes obsolete after two periods). 
Introduce a constant unit-production cost of c,(x) in the 
first period and c, in the second period. x is the prob- 
ability that the first-period good is still usable in the 
second period. Thus, if q; is the first-period production, 
xq, is the number of units of the durable good still around 
at date 2 before further production by the monopolist. 
(Assume ¢} > 0, cf > 0; and c{(0) “small” and ci(1) “large,” 
so as to obtain an interior solution.) 

(i) Show that if the monopolist can commit, he chooses 
the cost-minimizing durability: cj(x) = dc. 

(ii) Show that if the monopolist cannot commit, dura- 
bility is suboptimal: 
ej (x) = aa + tt ay) < 6b, 
where p,(xq,) is the second-period price chosen by the 
monopolist given that the residual demand is equal to 
demand minus existing stock (xq, ). Interpret. 


Thus, the choice of durability is biased by the absence 
of commitment. Another bias in technological choices, 
also noted by Bulow, concerns investments. Suppose, for 
instance, that the durable-good monopolist chooses be- 
tween not investing today and facing a high marginal 
cost in all future periods, and investing so as to face a 
lower marginal cost. The monopolist may choose not to 
invest in the absence of commitment, whereas he would 
invest if he could commit. Choosing a high-marginal-cost 
technology makes possible a certain commitment not to 
flood the market in the future and may thus be profitable. 


47. See Bulow 1986 for somewhat more general conditions under which 
planned obsolescence can be obtained and for an analysis of the oligopoly 
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context. For further developments on the issue of durability, see Schmalensee 
1974, 1979 and Liebowitz 1982. 
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Answers and Hints 
Exercise 1.1 


(i) W® is equal to consumer surplus plus profit: 
W* = max ({ x *dx + (p— op”). 
> \Jp 
The maximum is, of course, obtained through marginal- 
cost pricing: p = c. This yields 
We= f x‘dr=cl*Ue—1) fore>1. 


(ii) Welfare under monopoly, W™, corresponds to price 
p™ = c/(1 — 1/e). Some computations yield 


WL = W*— Ww" 


‘ape ey 


(iii) Tedious computations show that WL is nonmono- 
tonic in ¢. Such a result is not very instructive, as the size 
of the market decreases with e. Next, note that 


WL 
we 


=1—K(e), 


where InK(e) = In(2e—1)—In(e—1)—elne+eln(e —1). 
Therefore, 


K@ 2 
Ke) 2e—1 

+ In(e — 1), 
or 
K'(e) 


e—1 
Ke) 2e—1 in( é ): 
It is easy to show that the right-hand side of this last 
equation is increasing in ¢, and therefore that it reaches its 
maximum, equal to zero, at = +00. Thus, K'(e) <0, 
and WL/W‘ is an increasing function of e. 
Finally, 


m=—-pe . 
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so that 
nm et 
We (¢—1)* 


The right-hand side of this last equation is increasing in . 
Exercise 1.2 


The monopolist charges p™ = 5. The quantity consumed 
is equal to the number of consumers, and is the same as 
under marginal-cost pricing. Monopoly pricing simply 
transfers the surplus (5 — c) per consumer from the con- 
sumers to the firm. 


Exercise 1.3 


(i) For p = q""*, 


dm 
dé We 
For p = « — Ba’, 
tom 

dé 1+6 


For p = a — bIng, 


dp™ 
= 1. 

(The sensitivity of price to marginal cost is the same as in 
a competitive market here, in spite of the fact that the 
elasticity of demand is not equal to infinity.) 

(ii) Only when the elasticity of demand is constant is 
the sensitivity of price to marginal cost unambiguously 
related to the elasticity of demand. This can be seen from 
the monopoly-pricing equation: 


c+t 

P= PO = Tre 

For general demand functions, the elasticity of demand 
varies with the “size” of the market (which depends not 
only on market conditions but also on the level of the 
tax). Thus, even with a constant marginal cost, the supply 
equation (i.e. the above first-order condition) cannot 
be estimated separately from the demand equation (the 
identification problem) unless one is confident that the 
demand function belongs to the constant-elasticity-of- 
substitution class, 
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Exercise 1.4 

Let R(p) = pD(p) = p'~*. The first-order condition is 
R'(p) — C'(D(p))D'(p) = 0, 

The second-order condition is 

R"(p) — C'"(D(p))[D'(p)]? — C'(D(p))D"(p) < 0. 


Quasi-concavity is obtained if the second-order condition 
holds for any p satisfying the first-order condition. That 
is, substituting the first-order condition into the second- 
order condition, 


R'(p)D"(p) 
D(p) 


The second term of this last inequality is negative. Hence 
(using the fact that R’(p) < 0 from the first-order condi- 
tion), it suffices that 

R"(p) > D'(P) 


Rp) ~ D'(p) 
This is easily checked using logarithmic derivatives, 

Ife < 1, then R'(p) > 0. Thus, the first-order condition 
has no solution (the optimal price and the monopoly 
profit are infinite). 


R"(p) — C"(D(p)[D'(p)? — <0. 


Exercise 1.5 


9) = 42 = D(p, + pz). Equation 1.6 can be written, for 
all i, as 


@ _(.aG) aD _ 
p+ =(522)3 , where p =p, + po. 


7 ip 
Letting C = C, + C, and adding the two equations yields 
p42 2000 

Pop aq Op 


Exercise 1.6 


(i) The capacity constraint is not binding for small off- 
peak demand. Hence, the marginal cost of production is c, 
which leads to MR, (p}) = c. For peak demand, the mar- 
ginal cost is c + y. Hence, MR,(p3) = c + y. This holds 
as long as the constraint D,(p}) < D,(p3) is not binding 
—ice., as long as off-peak demand (i.e., 4) is small. 

(ii) For large off-peak demand, the constraint D,(p;) 
<D,(p,) is binding. The distinction between off-peak 
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and peak is somewhat blurred, because both types of 
demand are capacity constrained. At the optimum, a small 
increase in capacity is matched by a decrease in both p, 
and p, so that the new capacity can be used in both states 
of demand. The monopolist maximizes 


Ry(py) + Ra(pa) — (e + 7/2)(Dy(p1) + Da(p2)] 


subject to the constraint D,(p,) = D,(p,). Using the 
Kuhn-Tucker conditions (or, simpler, maximizing with 
respect to output instead of prices), one obtains 


I 1 
n.(t-2) + no(1-2)=20+ 


where €, and ¢; are the elasticities of demand. This equa- 
tion together with D,(p,;) = Dz(p2) gives p, and p, (note 
that p, <p). For instance, for D,(p,)=9%,p, ° and 
D,( pz) = %2p27* (where 4 = «,/a, < 1), 


2c+y 


Po = — Ie)|t + (oy fay) *) 


Exercise 1.7 


(i) For a formal proof, use the Pontryagin theory of 
optimal control (see, e.g., Arrow and Kurz 1970; Dixit 
1976; Intriligator 1971; Kamien and Schwartz 1981). The 
intuition behind the optimality condition is as follows: 
Raising q(t) by dq at time t yields MR(q(#))dq in terms of 
revenue. It raises current cost by c(c(t))dq. It also raises 
experience at all times s >t by dq, which reduces unit 
cost by —c’(c(s))dg. The number of units affected is q(s) 
(one can ignore the change in q(s) when using the equiv- 
alent of the envelope theorem). Marginal revenue must 
equal current cost minus future cost savings: 


MR (Q(t) = c(eo(t)) + [ c'(cw(s))q (se Pds 
= All). 


(Integrate by parts, remembering that dov(s)/ds = 4(s).) 

(ii) dA(t)/dt = r[—c(co(t) + A(H)] < 0, because current 
cost is higher than average future cost. Thus, d(MR)/dt 
<0, or dq/dt > 0. 


Exercise 1.8 


(i) Because the buyers’ decision function, summarized 
by w(p) = 4p, is stationary, the monopolist cannot gain 
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by charging a price that is accepted by no buyer (he will 
face the same decision problem in the next period and will 
have lost one period). From the buyers’ decision function, 
we know that at any given point of time f there exists a 
valuation v such that buyers with a valuation above v 
have already bought and other buyers have not. (This 
property is actually very general; see Fudenberg et al. 
1985 and Gul et al, 1986. It is just an “incentive com- 
patibility constraint” for this Bayesian game.) The mo- 
nopolist's intertemporal profit from date ¢ on is 


Vio) = S10 — dpv)py + 8¢Ap, — dpadp2 +} 
where v > Ap, > Ap; >-°-. 

(ii) Differentiating with respect to p, gives 
v — 2dp, + dap, = 0. 
But p, = wu and p, = w(Ap,) = Au?v, We thus obtain 
1— 24u + 647? = 0. 


(Check the second-order conditions!) 
(iii) A buyer with valuation Ap is indifferent between 
buying now and waiting one more period, so 


(ap — p) = 6(Ap — wUp)). 

(iv) Solving for 4 and p gives 
Ap = (1 — ,/1 — 6)/6, 
4=(/1—9y, 
Bh=(/1—6—(1— 6). 
Notice that lim,.., 2 = 0. 
Exercise 1.9 

(i) A price p, > p,1 for some positive k will be ac- 
cepted by no consumer, because a consumer could buy 
at the lower price p,-, and enjoy the good earlier. Thus, 
choosing p, = min,5 o(p,-4) leads to the same outcome. 


(ii) A consumer with valuation p is indifferent between 
accepting p, and waiting for p,,, if and only if 


(v — p,) = 5(0 — prs) 


Thus, the monopolist’s optimization problem can be writ- 
ten as 


— 6 —35 


The first-order conditions are 


Pp, =4+ op, 
t2>2, 

— Pin + OP es 
Ti 1+6 


(iii) The sequence with constant price 1/2(1 — 4) is ob- 
viously a solution to the first-order conditions (check the 
concavity of the objective function). This is actually the 
unique solution, as the sequence p, — Prs1 = (Pr-1 — P/O 
leads to negative prices if p,-, — p, > 0 for some time t. 

The no-haggling result is actually more general and 
holds for any distribution of valuations, not only for the 
uniform one. 

There is a more elegant way to prove the no-haggling 
result using the theory of mechanism design. Since this 
theory is beyond the scope of this book, we preferred to 
use a more “mechanical” approach. 

For a more general model of monopoly pricing of a 
durable good under commitment, see Stokey 1979. 


Exercise 1.10 


(i) Write the firm's intertemporal profit and maximize 
with respect to p,, p2, and x. 
(ii) The second-period profit is 
1 (g,2) = max{(p, — ¢2)(D(p2) — 29;)}- 
Ps 


The envelope theorem implies that 


On, _ er 
Gad |p2(zq) — 2). 


The equation in the statement of the exercise is obtained 
by maximizing 


qx(P(qi) + Sp2(xqi)x — &(x)] + O2(xq,) 


with respect to x (to obtain the complete solution, max- 
imize this expression with respect to q, as well). 
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eee 
Appendix: A Heuristic Proof of the Coase Conjecture 


The following heuristic proof of the Coase conjecture, 
which is for the advanced reader, is inspired by Wilson 
1985. 

Consider an infinite horizon, t = 1, 2, .... It can easily 
be shown that for any sequence of past prices charged by 
the monopolist, {p,,...,p,-1}. the seller's posterior beliefs 
at the beginning of period f are that the buyers with 
valuations b in (0,b,] have not yet purchased the good 
whereas the buyers with valuations b in (b,, +00) have 
already purchased it for some b,. That is, the monopolist's 
posterior beliefs necessarily coincide with a truncation of 
his prior beliefs (this results from the fact that a high- 
valuation buyer is more impatient to buy). Following Gul 
et al. 1986, assume that the buyers follow a simple, “sta- 
tionary” strategy. When charged price p, at date t, buyers 
with valuations exceeding B(p,) buy (if they have not yet 
done so) whereas those with valuation under f(p,) do 
not, where f(:) is an increasing function and B(p,) > p, 
for all p, > 0. (The importance of this stationarity as- 
sumption was proved by Ausubel and Deneckere [1986].) 
For notational simplicity, let c = 0 denote the monopo- 
list's marginal cost and let F(b) denote the cumulative 
distribution of the buyers’ valuations on [0, +00) (with 
FO) =0, F(b)>0 for b>0, and F(+co) =1). Last, 
6 = exp(—rA) denotes the discount factor, where A is 
the length of real time between the periods. We will be 
interested by what happens when A tends to 0. 

Because of the stationarity of the buyers’ strategy, the 
monopolist’s present discounted value of profits from a 
given date t on depends only on the distribution of 
remaining buyers, which is summarized by the “cut-off 
valuation” b,. Let V(b,) denote this present discounted 
profit. Note that V(‘) is a nondecreasing function, and let 
F, = F(b,) denote the proportion of buyers who have not 
purchased before date t. 

Fix a real time ¢ > 0, and let A converge to 0. For any 
n> O, there exists a A sufficiently small and a f (such that 
(t+ 2A <.) satisfying 


F — Fa = Fly) — Flos) <0. (Al) 


That is, as the number of periods between 0 and e tends 
to infinity, one can always find two consecutive periods 
such that the total quantity sold during these two periods 
is bounded by a given number. 


91 


Coase’s intuition was that if the profit V(b,,) from real 
time € on were not negligible, the monopolist would have 
an incentive to speed up the process by cutting price 
faster. For instance, between dates t and f + 2 he does 
not sell very much, and he does not gain much from dis- 
crimination either, because the real time between these 
two periods is short (so the price cannot decrease much 
between these periods; otherwise the buyers would wait 
and no one would buy between these two periods). Thus, 
in a sense, the gain from discrimination is of the second 
order, whereas by making at date f the offer he plans to 
make at date t + 1 the monopolist would speed the pro- 
cess by one period and obtain a first-order gain (in A) if 
V(byj4) were not small, To formalize this intuition, let us 
write the condition under which the monopolist prefers to 
charge p, at date t and p,,, at date f + 1 rather than p,,, 
at date ¢ directly: 


PAF, — Fesr) + Spear (Fier — Fiz) + 5? V(bya2) 

2 Prs(Fr — Fez) + 6V(b, 42). (A2) 
This is equivalent to 
(Pe — Pers )Fe — (Pa Oppas Fray +L — 8) Peer Foz 

> (1 — 6) V(b,42). (A3) 


By definition of b,,,, the buyer with valuation b,,; is 
indifferent between accepting p, and accepting p,,.,; thus, 


bis — Pe = (bis) — Prat), (A4) 
which implies that 

Pm OPiss = (1 — 8)by 4 (AS) 
and 

Pe — Pra = (1 — Obi — Pyar: (A6) 


Substituting equations A5 and A6 into equation A2 and 
dividing by (1 — 5), we obtain 


(besa — Pest) — bisa Flea + Pita Fiv2 2 OV(by42); (AT) 
that is, 
byai(F — Fras) — Perl — Fitz) & OV(bi42). (AB) 


But ¢-+ 2 <e/A implies that b,,. > b,, and thus that 
V(bi42) & Vibya). Also, Fi41 > Fiy2- Hence, equation A8 
becomes 
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(bisa — Presi) (Fi — Five) 2 OV (bya), (A9) 


which implies that by choosing 7 sufficiently small we can 
make the seller's present discounted value of profits from 
real time ¢ on arbitrarily small. We thus conclude that we 
can make the present discounted value of profits from any 
time on (including times close to zero) arbitrarily small by 
choosing A sufficiently small. The end of the “proof” of 
the Coase conjecture is then routine, The profit can go to 
zero only if prices tend to zero (more generally, to mar- 
ginal cost). All trade takes place in “the twinkling of an 
eye,” #8 
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Chapter 1 


Product Selection, Quality, 
and Advertising 


One of the functions of the production system is to select 
the commodities that are produced and sold. This selec- 
tion involves real economic choices; owing in particular 
to increasing returns to scale, only some of the potentially 
producible goods are actually manufactured. In the pre- 
ceding chapter we assumed that the set of goods pro- 
duced by the monopolist was given. In this chapter we 
relax that assumption by letting the monopolist choose a 
position in a “product space.” This is not to say that other 
monopoly decisions are entirely put aside. For instance, 
the monopolist’s choice between different qualities de- 
pends on how these qualities can be marketed, so pricing 
behavior cannot be ignored. 

This chapter starts with a brief description of the pro- 
duct space. It then discusses the choice of product by a 
monopolist, assuming that the characteristics of the pro- 
duct become known to the consumers before they pur- 
chase. Depending on the application, the monopolist may 
offer too high or too low a quality and too much or too 
little product variety relative to the social optimum. The 
monopolist has no reason to choose the optimal products; 
however, in contrast with the case of pricing behavior, 
one cannot sign the bias without further analysis of the 
consumers’ preferences and the production technology 
(section 2.2). 

Sections 2.3 and 2.4 consider goods whose character- 
istics are learned by the consumers only after purchase 
(“experience goods”). The main issues in regard to such 
goods are whether there are incentives for firms to supply 
quality and whether variables such as price and advertis- 
ing provide any (indirect) information about it. The main 
incentive to provide quality is the possibility of repeat 
purchases by consumers, which induces firms to sustain 
quality so as not to hurt their reputation and lose future 
sales. The key ideas behind signaling and reputation are 
mentioned in the body of the chapter and developed in 
the supplementary section. 


2.1 The Notion of Product Space 


As is well known, it is hard to come up with a satisfactory 
definition of the notion of an industry or a market. On the 
one hand, two goods are almost never perfect substitutes 
(in the sense that all consumers are indifferent between 
the goods when they have the same price). Goods are 
almost always differentiated by some characteristic. On 
the other hand, a group of products (an “industry”) 
always interacts to some extent with other goods in the 
economy; the pricing of goods outside the industry enters 
into the demand for the goods in the industry not only 
through income effects but also through substitution 
effects. The notion of an industry is an idealization or a 
limit case. 

There remains the question of how to describe the 
differentiation between the goods within an industry. 
This question has been addressed by, among others, 
Hotelling (1929), Chamberlin (1951, 1962), and Lancaster 
(1966). A good can be described as a bundle of character- 
istics: quality, location, time, availability, consumers’ in- 
formation about its existence and quality, and so on. Each 
consumer has a ranking over the mix of variables. 

Introducing all potential characteristics provides a rich 
description of a good, but is likely to be of little help in 
studying issues of industrial organization. Both in em- 
pirical work and in theoretical work, researchers focus 
their attention on a small subset of characteristics and 
on a special (but, if possible, reasonable) description of 
preferences. There are three types of cases that are com- 
monly used. 


2.1.1 Vertical Differentiation 


In a vertically differentiated product space, all consumers 
agree over the most preferred mix of characteristics and, 
more generally, over the preference ordering. A typical 
example is quality. Most agree that higher quality is 
preferable—for instance, that a Volvo is preferable to a 
Hyundai. (However, more consumers may still purchase 
the latter. The consumers’ income and the prices of the 
cars, and of servicing them, determine the consumers’ 
ultimate choice. See below.) Similarly, a smaller and more 
powerful computer is preferable to a larger, less powerful 
one. At equal prices there is a natural ordering over the 
characteristic space. 


Example 


A simple example of a vertical-differentiation model is the 
following. Each consumer consumes one or zero units of 
a “good.” The good is characterized by a quality index s 
(we will use s, for “services,” so as not to confuse quality 
with quantity, which is denoted by q). When the monop- 
olist produces several qualities, we will often talk about 
these different qualities as being “different goods.” For 
the moment, we shall confine ourselves to the single- 
quality/good monopolist. A consumer has the following 
preferences: 


@s —p_ if he buys a good with quality s 
u= at price p, 


t) if he does not buy. 


U should be thought of as the surplus derived from 
the consumption of the good. s is a positive real number 
that describes the quality of the good. The utility is sepa- 
rable in quality and price. 8, a positive real number, 
is a taste parameter. All consumers prefer high quality, 
for a given price; however, a consumer with a high 
is more willing to pay to obtain high quality. Modeling 
a distribution of tastes consists of assuming that @ is 
distributed in the economy according to some den- 
sity f(@) with cumulative distribution function F(8) on 
{0, +00), where F(0) = 0 and F(+ 00) = 1. Thus, F(8) is 
the fraction of consumers with a taste parameter of less 
than 0. 

An interesting reinterpretation of these preferences 
views 6 as the inverse of the marginal rate of substitution 
between income and quality rather than as a taste param- 
eter. As far as the choice between buying and not buy- 
ing is concerned, the consumer's preferences could be 
read as 


s—(1/0)p if he buys a good with quality s 
u= at price p, 
0) if he does not buy. 


On this interpretation, all consumers derive the same 
surplus from the good, but they have different incomes 
and, therefore, different marginal rates of substitution be- 
tween income and quality (1/8). Wealthier consumers 
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have a lower “marginal utility of income” or, equivalently, 
a higher 6.* 

Let us derive the demand function(s) for this particular 
utility function, If only one quality, s, is offered at price p, 
the demand for the good is equal to the number of con- 
sumers with taste parameter 9 such that 8s > p. In other 
words, the demand for the good is 


D(p) = N(1 — Flp/s)h 


where N is the total number of consumers. 

If there are several qualities offered in the market, the 
consumers choose among these qualities as well as choos- 
ing whether to buy at all (assuming they have unit de- 
mands—i.e., they consume at most one unit of the good 
—whatever the quality). Suppose, for instance, that goods 
of two qualities, s, < s,, are sold at prices p, < p,. (The 
price inequality makes the problem nontrivial, as a low- 
quality good that is more expensive than a high-quality 
one will never be purchased.) First, assume that the “qual- 
ity per dollar” is higher for quality 2: s,/p, > s,/p,. Then, 
all consumers always prefer quality 2 to quality 1 if they 
purchase at all: 


(85, — p2) — (85, — p,) 
= p,(0s2/p. — 1) — p, (8s, /py — 1) 
2 (pr — py) (O5,/p, — 1) 
20 
if Os, > py 
The demand for the high-quality good is then 
D,( Py, P2) = N{I — F(p2/s3)1, 


and the demand for the low-quality good is zero. The 
more interesting case occurs when the low-quality good 
is not “dominated.” Then, the consumers with a taste 
parameter exceeding 6 = (p,— p,)/(s; — 5) buy the 
high-quality good, since @s, — p, > 85, — py > 0 > 6, 
those with a taste parameter lower than but exceeding 
pi/s, buy the low-quality good, and the others do not 


buy. Thus, the demands are 

D,(py,P2) = N[L — F(p2 — py Sz — 8))) 

and 

Dy (Pr, P2) = NIF((p2 — Pr/52 — 51) — Fler /sy)]. 
2.1.2. Horizontal Differentiation 


For some characteristics, the optimal choice (at equal 
prices) depends on the particular consumer. Tastes vary in 
the population, An obvious example is the case of colors. 
Another example is location. Bostonians are likely to 
prefer goods that are available in Boston to goods that 
are physically the same but are available only in Paris. 
Similarly, consumers will prefer to go to a shop or a 
supermarket that is nearby. In such cases of horizontal or 
“spatial” differentiation, there are no “goods” or “bads.” 


Example 


A simple model of horizontal differentiation is given 
in Hotelling 1929. Consider a “linear city” of length 1. 
Consumers are distributed uniformly along the city. As 
the analysis greatly resembles the vertical-differentiation 
example, let us start directly with a “two-good” example, 
which is used in exercise 2.3 below. Two shops, located at 
the two ends of the city, both sell the same physical good. 
The location of shop 1 is x = 0, and that of shop 2 is 
x=. (See figure 2.1.) Consumers have transportation 
cost t per unit of length (this cost can include the con- 
sumers’ value of time). They have unit demands; they 
consume zero or one unit of the good. Let p, and p, 
denote the prices charged by the two shops. The “gen- 


Shop 1 Shop 2 
5 ; 7 
| _Gosttx Costin) 

Figure 2.1 


The linear city. 


1, Suppose that consumers have identical ordinal preferences and differ only in 
their incomes. Consider the following separable representation of a consumer's 
utility function: LU = u(I — p) +5, where J is the consumer's income. (Taking a 
slightly more general function LU = u(I ~ p) + ®s) with ® increasing would 
not change anything; it would amount to a redefinition of “quality.”) Let p 
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be much smaller than J; ie., the expenditure on this particular good is small rela- 
tive to income. A first-order Taylor expansion yields U> —u'(I)p +. Let 
6 = 1/u'(/), If wis concave, wealthier consumers have a low u'(/) and therefore 
a high 0. 
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eralized price” of going to shop 1 (respectively, shop 2) 
for a consumer with coordinate x is p, + tx (respectively, 
p2 + HI — x). If F denotes the surplus enjoyed by each 
consumer when he is consuming the good, the utility of a 
consumer located at x is 


F—p—t 
if he buys from shop 1, 
5 —p,—H1—x) 


if he buys from shop 2, and zero otherwise. Let us derive 
the demand functions for three cases. 

If the price difference between the two shops does not 
exceed the transportation cost t along the whole city, and 
if prices are “not too high” (see below), there exists a 
consumer with location z who is indifferent between buy- 
ing from shop I and buying from shop 2: 


Pi b= py + 1 — 2) <> F(py mm) = (Po — i + H)/2t. 
Then the demands are 


Dy (py, P2) = NX(Py.P2) 


and 


Da( py, pa) = NIL — ¥(py, pod), 


where N is the total number of consumers. 

If the price difference between the two shops exceeds t 
(say, p2 — p: > 8), shop 2 has no demand. Shop 1 has 
demand D,(p;,p2) = Nif p, <5 — ¢t (ie, if all consumers 
are willing to buy at shop 1) and demand D,(p,,p) 
= N(5 — p,)/t if p, > 5 — ¢. In the latter case, the market 
is “not covered,” in that some consumers do not buy 
at all. 

The third case is that in which each shop has a local 
monopoly power (and the market is not covered). This 
occurs when both p, and p, belong to [5 — #,5] and the 
consumer located at X(p,,p.) who would be indifferent 
between the two shops does not buy, (This is equivalent 
to the condition p, + p, +! > 25.) The three cases are 
illustrated in figure 2.2. 

Figures 2.2a—2.2c represent the generalized costs for 
the consumers as a function of their locations, as well as 
the market shares of the two products. In figure 2.2a, the 
two products compete for consumers. A reduction in one 
product's price reduces the demand for the other product. 
In figures 2.2b and 2.2c, the pricing policy of one product 
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5-P, 3-p, 
S—pz,— M1 =x) 
0 P,P.) Tore 
5-p, 
5- Pp 
x 
0 1 
b 
5-Py 3-p, 
0 1 >x 
c 
Competitive 
—> 9, 
d 
Figure 2.2 
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does not affect (locally, at least) the demand for the other 
product. Figure 2.2d depicts the residual demand for good 
1, taking the price of good 2 as given. This demand curve 
exhibits a kink at the price at which the local monopoly 
and the competitive regimes meet (a borderline case in 
figures 2.2a and 2.2c). At this point, a unit increase in p, 
lowers the demand for good 1 by 1/t whereas a unit 
decrease in p, raises the demand for good 1 by only 
1/2t? 


2.1.3. “Goods-Characteristics” Approach 


Goods are defined as bundles of characteristics, and the 
consumers have preferences over characteristics. The con- 
sumers may have heterogeneous preferences over charac- 
teristics. In the vertical- and horizontal-differentiation ap- 
proaches, it was assumed that consumers purchase only 
one good—in other words, that they do not get extra 
utility from consuming a variety of goods. In contrast, 
one could assume that consumers can consume several 
goods and, furthermore, that all they care about in a good 
is its characteristics. For example, assume that what the 
consumer cares about in foods is protein and vitamins. If 
one unit of food 1 supplies two units of protein and one 
unit of vitamins, one unit of food 2 one unit of protein 
and two units of vitamins, and one unit of food 3 one unit 
of each, the consumer is indifferent between one unit of 
foods 1 and 2 and three units of food 3, In other words, 
the consumer ultimately cares only about the character- 
istics of the bundles of goods, i.e., the sum of the char- 
acteristics of the goods in the bundle. This is the approach 
pioneered by Lancaster (1966). 

The goods—characteristics approach makes sense in a 
number of cases. For instance, when buying light bulbs, 
the consumer certainly cares mainly about the total num- 
ber of hours of lighting provided by the bundle of bulbs. 
The key to the approach is to be able to sum up the 
characteristics. In some cases this approach is less handy 
—in particular, where there are indivisibilities of con- 
sumption, as in the vertical- and horizontal-differentiation 


examples above. The Lancaster framework must then be 
amended.* 


2.1.4 Traditional Consumer-Theory Approach 


In its extreme form, the Lancasterian approach ignores the 
notion of good to focus on that of characteristic; goods 
are here only to provide characteristics. Conversely, one 
can ignore the notion of characteristic and focus on that 
of good. This is indeed the approach taken by classical 
general-equilibrium theory: Both production functions 
and utility functions are defined as functions of the quan- 
tities of the various goods. One cannot be more general 
than by writing a utility function U(g, 4), 42.---,4n) with 
components the consumptions of goods 0, 1, 2,..., 1. The 
questions are these: How can we impose more structure 
and get results? What does this added structure imply 
about products? 

The following utility function is commonly used to 
study product selection (see chapter 7). There are two 
sectors in the economy. The “representative consumer” 
(all consumers are identical) consumes a quantity qq of the 
unique good produced by the first sector and quantities 
{q;}’-) of the n goods produced by the second sector, 
which is called “the sector of differentiated products.” His 
utility function is 


unu(a(§e)") 


For LU to be quasi-concave, p must be <1. If p; is the price 
of the differentiated product i, and if the good in the first 
sector is the numéraire and I is the consumer's income, the 
representative consumer maximizes LI subject to the bud- 
get constraint 


‘ 
go + ¥ pai SI. 
& 


The “sub utility function” for the differentiated goods is a 
constant-elasticity-of-substitution (CES) utility function. 


2. There is also a large literature in marketing that deals with the notion of 
product space, Schmalensee and Thisse (1987) draw the link between the 
"perceptual mapping” techniques used in that literature and the location tech- 
niques considered here. 


3, As Rosen (1974) notes, “two six-foot cars are not equivalent to one twelve 
[feet] in length, because they cannot be driven simultaneously” (or, “two fiddles 
don’t make a Stradivarius"), Rosen departs from the Lancasterian framework on 


the assumption of divisibility. As a consequence, the “hedonic prices,” which 
give the total cost of a bundle of characteristics, may not be linear in the 
amounts of the characteristics even under perfect competition. That is, one 
cannot compute the price of a good by simply adding the amounts of its 
characteristics weighted by prices affected to unit amounts of these character- 
istics. See also Lancaster 1975, 1979, 
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As was noted above, the approach in terms of goods 
rather than characteristics is very general (for instance, the 
basic Lancasterian form simply presumes linear functional 
forms). However, for applications to industrial organiza- 
tion, it may have some drawbacks, As it does not make 
explicit the characteristics over which firms are compet- 
ing, it sometimes gives limited intuition conceming the 
plausibility of preferences. Or else, it may be very spe- 
cific, as in the case of the CES utility function given 
above. The latter utility function treats all the differen- 
tiated products in a symmetric way. When a firm intro- 
duces a product, it does not choose its degree of differen- 
tiation relative to other products. This contrasts with the 
horizontal- and vertical-differentiation approaches as well 
as with the goods—characteristics approach, in that there 
is no notion of “remoteness” or “neighborhood” rela- 
tive to the other products. In particular, this approach is 
poorly adapted to describe a confined space.* Industrial- 
organization economists generally feel that a new product 
does not compete as closely with each and every other 
product. 


2.2 Product Selection 


Let us now investigate what kind of bias a monopolist 
introduces in his choice of product(s). Throughout this 
section we will follow fairly closely the work of Spence 
(1975, 1976).5 We start with the choice of quality (a 
vertical characteristic) by a single-product monopolist. 
We then look at the closely related question of whether a 
monopolist supplies too few or too many products (from 
a social standpoint). 


2.2.1 Product Quality 


Assume that a monopolist produces a single good, for 
which he chooses two real numbers: a price p and a qual- 
ity s. Let p = P(q,s) denote the inverse-demand curve, ie., 
the price that creates a demand for q units of the good at 
quality s. Quality is desirable, in that P increases with s. 
Let C(q,s) denote the total cost of producing q units of 


a good with quality s. It is natural to assume that C 
increases with s. 

Let us first consider the choice of quality by a social 
planner, who would choose price and quality (or, equi- 
valently, quantity and quality) so as to maximize the 
difference between gross consumer surplus and produc- 
tion cost. Taking quality and quantity as the decision 
variables, the social planner maximizes 


4 
Wq,s) = [ P(x,s)dx — C(q,5). 
° 
We approximate gross consumer surplus by the integral 


under the demand curve given the specification of quality. 
The first-order conditions are 


P(q.s) = C,(q, 5) (2.1) 
and 
4 
f P(x, s)dx = Clas), (2.2) 
o 


where subscripts denote partial derivatives. Equation 2.1 
is the now-familiar equality between price and marginal 
cost. Equation 2.2 stems from the choice of quality. It 
states that the partial derivative of gross surplus with 
respect to quality is equal to the marginal cost of produc- 
ing this quality. To understand this formula, it may be 
helpful to think of the demand curve as stemming from a 
large number of consumers with unit demands, ranking 
them in decreasing order of willingness to pay. P(x,s) is 
then the price that makes the xth consumer indifferent 
between buying one unit of the good of quality s and not 
buying. Thus, P,(x, s) is equal to the willingness to pay (in 
terms of money) by consumer x for one more unit of 
quality. Alternatively, P,(z, s) is the marginal valuation for 
quality for the marginal consumer when the price is P(z, s), 
Hence, the marginal gross surplus is equal to output q 
times the average marginal valuation of quality over the 
market, which is 


([[ nese) 


4. As we will see, in the case of such a utility function for large n, a change in 
the price of good i > 1 does not trigger a change in the price of good j > 1 
This abstract utility function is, however, very convenient to use when study- 
ing the number of products, because it allows one to concentrate on the entry 
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decision ("zero or one”) without complicating it with choices of location, 
quality, etc. See chapter 7 


5. See also Sheshinski 1976. 
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The monopolist is concerned not with social surplus 
but with profits. Thus, he maximizes 


1" (qs) = q P(q.s) — C(q,s). 


This leads to the first-order conditions 


P(q,s) + q P,(q,s) = C,(q,.s) (2.3) 
and 
4 P.(q.s) = C(q,5). (2.4) 


Equation 2.3 is the familiar equality between marginal 
revenue and marginal cost, which is the expression of 
optimal pricing by the monopolist. Equation 2.4 deter- 
mines the optimal quality for a given output q, It states 
that the marginal revenue associated with a unit increase 
in quality is equal to the marginal cost of producing this 
quality. 

The difference between equations 2.1 and 2.3 is famil- 
iar; the monopolist is concerned with the effect of output 
on price, whereas the social planner (or a competitive 
firm) is not. More interesting here is the comparison be- 
tween equations 2.2 and 2.4. The social planner’s concern 
about the average marginal valuation for quality is re- 
placed by the monopolist’s concern about the “marginal 
marginal” valuation P,(q, s), where the first marginal refers 
to the consumer and the second to quality. (For ease of 
notation, we will often delete the “marginal” correspond- 
ing to quality.) This is easily understood. The social plan- 
ner looks at the effect of an increase in quality on all 
consumers; the monopolist considers the effect of an in- 
crease in quality on the marginal consumer. When raising 
the quality by As, the monopolist can raise the price by 
P,(q,s)As while keeping the same demand (i.e, keeping 
the marginal consumer at the same utility level). This 
price increase, however, can be passed on to all infra- 
marginal consumers, which generates an extra revenue 
qP.(q.s)As. 

In summary: The incentive to provide quality is related to 
the marginal willingness to pay for quality, for the marginal 
consumer in the case of a monopolist and for the average 
consumer in the case of a social planner. 

Hence, it is natural to compare P,(q, s) and ({f,P,(x, s) dx)/ 
q. A caveat: This comparison tells us which way the mo- 
nopolist biases his choice of quality only if the output 
is the same in both arrangements. This property generally 
does not hold, because, as we have seen, the monopolist 
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tends to restrict output for a given quality, Hence the 
qualifier in the following proposition: For a given output q, 
the monopolist undersupplies quality relative to the 
optimum when 


({' Pisite) fq > P,(q.5), 
° 


and conversely. Because there is no @ priori reason why 
the marginal consumer is representative of the population 
(ie., has the same marginal valuation for quality as the 
inframarginal consumers), the monopolist, in general, 
introduces a bias in product selection at a given output 
level. 

In both the case of choice of quality and that of mo- 
nopoly pricing, the monopolist is concerned with the 
effect of his decision on the marginal consumer's demand; 
the social planner cares about the effect of the decision 
variable on the inframarginal consumers’ welfare as well. 
The analogy becomes trivial in the extreme case in which 
services are a perfect substitute for price reductions (this 
may be the case, for instance, when the “service” is the 
price of delivery of the good). Suppose that the con- 
sumers care only about the “real price” p — s, where s 
denotes the monetary value of services; their demand 
function is q = D(p —s), so their inverse demand func- 
tion is linear in services: 


p= D"'(q) +s = P(q.s). 


Suppose further that the cost of supplying the service is 
C(q.s) = sq. Then s is the equivalent of a price reduction 
from the point of view of consumers and seller. It is left 
to the reader to check that equation 2.4 is satisfied. From 
here on, we will be interested in those quality variables 
that are imperfect substitutes for price reductions. It is 
useful for some intuitions to think of this trivial case of 
perfect substitutes. 

The previous proposition gives a simple condition to 
determine whether the monopolist oversupplies or under- 
supplies quality. This may seem a weak proposition, in 
that it relies on an assumption—same output in the two 
arrangements—that is very unlikely to be satisfied. We 
should pause for a moment here and ask how important it 
is to compare the monopolist’s and the social planner’s 
choices of qualities once the difference in pricing be- 
haviors (in outputs) is taken into account. Such a com- 
parison makes sense if we are interested in moving from 
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a monopoly to a perfect socially oriented organization. 
When making a choice between a monopoly and a socially 
oriented organization, one may want to focus on the total 
dead-weight loss associated with monopoly, and not only 
on the quality distortion. If, in contrast, the private mo- 
nopoly arrangement is not at stake, and one simply wants 
to affect the monopolist’s choice of product quality 
through subsidies, minimum quality standards, and so 
forth, then the previous proposition, which takes output 
as given, becomes more relevant. For instance, an increase 
in quality is socially desirable if the average valuation for 
quality exceeds the marginal valuation for quality and if the 
policy that brings forth this increase in quality does not 
induce the monopolist to contract output (remember that 
an increase in output at the monopoly solution is socially 
desirable). The bottom line is, as usual, that the relevant 
issue depends on the policy move being considered, 


2.2.1.1 Applications 


Swan's (1970) Optimal-Durability Theorem 


Suppose that the quality variable is the durability of the 
good, For instance, let s denote the number of lighting 
hours of a light bulb. A consumer who buys q bulbs 
obtains gs hours. Assume that the consumers care only 
about the total number of hours, and not about the way 
these hours are obtained. In this Lancasterian world, the 
price per unit of durability that induces consumers to buy 
q units of a good of quality s depends only on the product 
of these two variables; P(q, s)/s = P(qs).© Let 7 = qs de- 
note total consumption (and, similarly, let ¥ = xs, where x 
denotes an arbitrary consumption level). Further, assume 
that the production cost is linear in output: C(q, s) = c(s)q. 
(Unit production cost naturally rises with quality c’(s) > 0.) 

The social planner (or a competitive industry) max- 
imizes total welfare: 


W(a.s) = f * pix, side — cls)q 
i 


= ii sP(xs)dx — c(s)q 
° 


= [ Pia)dz — (2), 
o s 


The maximization with respect to q and s is equivalent to 

that with respect to q and s. In particular, we see that the 

social planner minimizes the unit cost of durability c(s)/s. 
The monopolist maximizes profit: 


TH" (q.s) = q P(q.s) — cls)q 


The first-order condition with respect to s shows that the 
monopolist also minimizes the unit cost of durability. 
Hence, monopoly power does not introduce a distortion 
in the choice of durability. This result is part of the Swan 
invariance theorem, which states that, if consumers care 
only about total services 7 = gs and if the production 
function exhibits constant retums to scale in output, the 
choice of durability is independent of the market structure. 

This invariance is quite natural. The consumers actually 
care about the composite good g. The firm, in all arrange- 
ments, wants to minimize the cost of producing this com- 
posite commodity. Under constant returns to scale, its 
unit production cost ¢(s)/s is independent of output. Thus, 
durability and price decisions are disjoint. (The invariance 
result does not hold if the cost function C(q,s) cannot 
be written as a product. And, of course, the assumption 
that consumers consider two one-year bulbs to be perfect 
substitutes for one two-year bulb is also crucial to the 
result.’) 


The Dorfman-Steiner (1954) Condition 


In their seminal paper on advertising, Dorfman and Steiner 
assumed that the demand facing the firm is a function of 
price and advertising, so that P(q,s) denotes the inverse 
demand function, where s denotes the total expenses of 
advertising. This formulation is not appropriate for wel- 
fare analysis; rational consumers, in general, may not en- 


6. More formally, consider for simplicity @ representative consumer, and let 
Ul(qs) denote the gross surplus for this consumer consuming q units of the good 
with durability s. The net surplus is Li(gs) — pq. The consumer's optimal con- 
sumption is given by s LI'(qs) = p. The expression P(qs) in the text is nothing 
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but U'(gs), This analysis is easily generalized to the case of heterogeneous 
consumers. 


7. For more on the durability and invariance issues, see Hirshleifer 1971, 
Kihlstrom and Levhari 1977, and Liebowitz 1982. 
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joy advertising per se. The reasons why advertising affects 
demand are not made explicit here and are subsumed in 
the “reduced-form” demand function P(q,s).8 Thus, we 
study only the monopolist’s program, and not the plan- 
ner's, The monopolist’s profit is 


TI™(q,s) = 4 P(q,s) — C(q) —s. 


The cost function is assumed to be additive in output and 
advertising. Conditions 2.3 and 2.4 can be written as 


P(q,s) + q P,(q.s) = C, 
and 
4 P,(g,s) = 1. 


To obtain the Dorfman-Steiner condition, it turns out to 
be convenient to maximize over price and advertising 
rather than over quantity and advertising. Let q = D(p,s) 
denote the demand at price p and advertising level s. The 
monopolist’s profit can be rewritten as 


TI™(p,s) = p D(p.s) — C(D(p,5)) — s. 


The first-order conditions for the maximization with re- 
spect to p and s are 


D(p,s) + p D,(p,s) = C'(D(p, s))D,(p, s) 
and 

pD,(p.s) — C'(D(p,s))D,(p-s) = 1 

Let 


Soe 
op q 


aR, 
as q 


denote the elasticities of demand with respect to price and 


advertising, respectively. Then, rearranging the two first- 
order conditions yields the desired formula: 

54 

Pq 

The monopolist's optimal advertising/sales ratio is equal to the 
ratio of the elasticities of demand with respect to advertising 
and to price.° (Because sales and advertising are selected 


simultaneously, one should not conclude that sales deter- 
mine advertising in a causal sense.) 


Exercise 2,1* Show that if demand is of the Cobb-Douglas 
type—q = p-*s*, where a and f are positive—the adver- 
tising/sales ratio is a constant. (In particular, show that it 
is independent of the cost structure.) 


An Example of “Underprovision” of Quality 


Consider the example of vertical differentiation given 
above. Consumers have utility (net surplus) LI = 6s — p 
if they purchase and 0 otherwise. @ is distributed over 
the population with cumulative distribution F. If we nor- 
malize N = 1 (there is no loss of generality in doing so), 
the demand function is q = 1 — F(p/s), or p = P(q,s) 
= sF "(1 — q), where F~ (the inverse of F) is an increas- 
ing function. The average marginal valuation for quality, 
which guides the social planner’s quality decision, is 


- i P(x, s)dx = z [ FUL — x)dx, 
Gq Jo q 


° 


The marginal valuation for quality of the marginal con- 
sumer, which guides the monopolist’s decision, is 


P.(q.s) = F-'(1 — 4) 
Because for x <q 
Fl — x) > F“(1— q), 


the average valuation for quality exceeds the marginal 


8, An example of such a specification (from Butters 1977) is the following: 
Suppose that the role of advertising is to inform consumers about the existence 
of the product and its price. Let N, the number of consumers, be large. Assume 
that the monopolist sends a total number s of messages (ads) randomly to 
consumers (thus, some consumers get no ads, while others get several). A 
consumer's probability of not getting an ad is 


(l1—WNyY =e" 
for N large. Letting d(p) denote the representative consumer's demand function 
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(or the average demand over different types of consumers), we have 
D(p.s) = NO — 2) dip). 


Note that D has a multiplicative form in s and p, We will use this specification 
in chapter 7. 


9. See Nerlove and Arrow 1962 and Schmalensee 1972 for dynamic versions 
of this formula. 
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valuation, and, for a given q, the monopolist under- 
supplies quality. This is easily understood: The marginal 
willingness to pay for quality of the marginal buyer is 
lower than that of other buyers, who have higher 6. An 
exogenous and small increase in quality starting from the 
monopoly quality would thus benefit society. 

As was mentioned above, it is not clear, however, that 
the quality in the monopoly case is lower than that in 
the social-planning case. The social planner charges a 
low price and thus reaches a lower 6 than the monopo- 
list, Hence, the average valuation for quality given the 
social planner’s pricing policy may end up being lower 
than the marginal valuation for the monopolist. Indeed, 
in the exercise below the quality is the same in both 
arrangements. 


Exercise 2.2" In the above vertical-differentiation model, 
let @ be uniformly distributed on [0,1]. The cost func- 
tion is 


2 
cas (2) 


(i) Check that, at q given, 
1? 
‘| P.(z, s)dx > P,(q,s). 
4 Jo 


(ii) Show that, when the difference in output is taken 
account of, the monopoly and the social planner choose 
the same quality. 


An Example of “Overprovision" of Quality 


A consumer with “type” @ has utility @ + (7 — @)s —p 
if he purchases the good and utility 0 otherwise. The 
parameter 0 is distributed on (0, «|. Consider the situation 
where consumers enjoy the good per se (i.e., get utility 0 
from it) and also enjoy some extra services s < 1 that are 
attached to the good. But they enjoy these services more 
when their “intrinsic” willingness to pay for the good 4 is 
lower. For instance, a monopoly concert hall may offer 
concerts and also distribute booklets explaining the music 
and introducing the conductor. The benefit from these 
side services may be higher for low-@ consumers. (High-@ 
consumers may be wealthy people who can afford to buy 
music books, have more education, etc.) It is easy to check 
that for this specification of preferences there is over- 
provision of services at a given output. The marginal 


104 


consumer values services more than the average con- 
sumer who purchases the good. Again, this result is con- 
tingent on the output being fixed. A social planner would 
want to reach lower @ and, therefore, might want to 
provide more services than the monopolist. 


2.2.2 Too Many or Two Few Products? 


In the preceding subsection we considered the case of a 
monopolist choosing the quality of the single good he 
produces. In general, he may produce several goods. How 
does the monopolist’s choice of diversity compare with 
the socially optimal product diversity? The preceding 
example (which focused on the choice of a continuous 
variable rather than on the number of products) strongly 
suggests that the monopolist may choose either too 
many or too few products. As previously, “too many” or 
“too few” depends on the type of intervention one has in 
mind. In this subsection, we consider simple examples of 
the comparison between the monopoly solution and the 
social-planning solution (we do not study, for instance, 
the comparative statics of the monopoly situation). In 
chapter 7 we will return to the issue of product diversity 
in a framework of monopolistic competition. 


2.2.2.1 Nonappropriability of the Social Surplus and 
Underprovision of Diversity 


A firm that creates a new product generally cannot cap- 
ture all the gross surplus generated by the product. Con- 
sider the case with only one potential design. The monop- 
olist produces either one product or none. Assume that 
there exists a fixed cost f of introducing or producing 
the good (on top of the variable production cost). The 
monopolist introduces the product only if the monopoly 
profit II™ (measured in figure 1.1 by the area of the 
trapezoid CEFAC) exceeds f. The social planner intro- 
duces the good if and only if the social welfare W (mea- 
sured in figure 1.1 by the area of the triangle ADG) 
exceeds f. Note that W > II". Hence, if the fixed cost 
satisfies W > f > II™, the monopolist does not introduce 
the good; the social planner does. Thus, with only one 
potential product, monopoly may imply “too few prod- 
ucts” and never entails “too many products.” This can be 
explained by the fact that the monopolist, in general, 
cannot appropriate the social surplus. W — IT™ represents 
the sum of the dead-weight loss (measured by the triangle 
FEG) and the consumer net surplus (measured by the 
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triangle CDE). An attempt to capture the consumer sur- 
plus by raising the price raises the dead-weight loss. What 
matters for the efficiency of product selection is what 
fraction of net potential surplus is capturable by the 
monopolist. 


Remark 1 Even if the firm charges the monopoly price 
(thus introducing a dead-weight loss), there is still a ten- 
dency to have too few products under monopoly because 
the monopoly cannot capture the net consumer surplus. 


Remark 2 There is a case in which the monopolist can 
capture the whole potential surplus from the introduction 
of the good. This case arises when the monopolist can 
price discriminate perfectly (j.e., can capture the whole 
consumer surplus without introducing a dead-weight loss). 
As we will see in chapter 3, perfect discrimination requires 
a perfect knowledge of individual demand functions (and 
not only of the aggregate demand function), a nonlinear 
pricing schedule if consumers’ demands are not unit de- 
mands, and the absence of arbitrage between consumers. 


Remark 3 We will see in chapter 10 that an analogous 
problem of appropriability is the basis for Schumpeter’s 
teflections on the incentives for research and develop- 
ment (where the “good” is an invention). 


Remark 4 _A first intuition would be that the tendency to 
lose products is higher for higher elasticities of demand, 
because the monopolist is then induced to charge a lower 
price. However, for a constant-elasticity demand the frac- 
tion of potential surplus which the monopolist can appro- 
priate increases with the elasticity (see exercise 1.1), Thus, 
underprovision of a product occurs more frequently for 
low-elasticity products, 


2.2.2.2 Multiproduct Monopoly and Overprovision of 
Diversity 


When a monopolist can manufacture several goods, a 
new effect appears which tends to create too much diver- 
sity. Suppose that a monopolist can produce two goods 
which are substitutes (@D,/dp; > 0, i A j, i = 1,2). The 
price he charges for good 1 exceeds the marginal cost of 
producing that good. Thus, the demand function for good 
2is shifted up by the use of monopoly power for the first 
good. This may make it profitable to produce good 2, 
which it would not be profitable to do if the monopolist 
were to charge the competitive (socially optimal) price in 
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the first market. The monopolist may produce good 2 
when this is not socially optimal. The exercise below 
gives an instance in which this is indeed the case. 


Exercise 2.3** Consider the horizontal-differentiation ex- 
ample in subsection 2.1.2. Consumers are uniformly dis- 
tributed along a linear city of length 1, have transporta- 
tion costs ¢ per unit of distance, and have unit demands. 
Except for their location, they are all identical and have 
gross surplus 5 for the good sold by the monopolist. The 
monopolist can sell the good at different sites, and prod- 
uct diversity is then measured by the number of sites. 
For simplicity, a law requires that the shops be at the 
borders of the city (at abscissa 0 and 1). The fixed set-up 
cost of establishing a site is f. The marginal cost of pro- 
ducing the good is zero. Assume that t/2 > f > t/4, and 
that 5 is “sufficiently large” (so that the market is covered 
even if there is only one site). Show that the monopolist 
sets up two shops whereas the social planner sets up only 
one. Explain. 


The conclusion of this subsection is that a monopolist 
can provide too many or too few products. Besides the 
two effects identified here, we will identify a third effect 
which affects product diversity when we introduce mono- 
polistic competition. This third effect is linked to the 
externality between firms. 


2.2.3 Product Selection and Discrimination 


A monopolist who faces consumers with differing tastes 
is eager to learn which taste each particular consumer has, 
If he can do so, he may be able to charge a high price to 
a consumer with a high willingness to pay for his good 
and a low price to a consumer with a lower willingness to 
pay (“have the rich pay and still sell to the poor”), assum- 
ing that the consumers cannot resell the good (arbitrage). 
However, asking a consumer for his taste parameter is not 
“incentive compatible”; the consumer has a strong incen- 
tive to claim that he has a low willingness to pay so that 
he will be charged a low price. As we will see in chapter 
3, the monopolist can nevertheless try to discriminate 
between consumers by using variables that are related to 
the willingness to pay. One such variable is the quantity 
purchased by the consumer if the latter has a downward- 
sloping demand: Higher purchases are usually a signal of 
higher willingness to pay. This will lead to a justification 
of “nonlinear tariffs.” Another such variable is the quality 
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of the product purchased; for instance, in the unit-demand 
vertical-differentiation model of subsection 2.1.1, a con- 
sumer with a high @ is more eager to consume high- 
quality goods than a consumer with a low 8. The monop- 
olist can then use his product line or spectrum of goods 
(priced at different rates) to discriminate between con- 
sumers (to tell them apart). This is described in the sup- 
plementary section of chapter 3. We will see that under 
reasonable assumptions the quality sold to a particular 
consumer is suboptimal, and that the monopolist tends to 
offer “too many products.” 


— nem 


2.3 Quality and Information 


The quality of some goods (e.g. dresses) can be ascer- 
tained by consumers before a purchase. In other cases the 
quality is learned after the good is bought; this is the case, 
for instance, with the taste of canned food or the quality 
of a restaurant. For still other goods, aspects of the quality 
(e.g. the amount of fluoride in a toothpaste, the time- 
liness of a doctor's intervention) is rarely leaned, even 
after consumption. These three types of goods have been 
christened “search goods,” “experience goods” (Nelson 1970), 
and “credence goods” (Darby and Kari 1973). To be sure, 
most goods cannot be classified in this simple manner 
because they possess attributes that are learned before 
purchase, after purchase, or never. But this classification is 
quite useful for analysis. 

The previous section focused on search goods, The 
main issue for such goods is product selection (quality, 
product diversity). For experience goods, the main issue is 
information: How do consumers learn the quality? What 
incentives do firms have to supply it? We will see that 
repeat purchases offer some consumer control over qual- 
ity for such goods. Credence goods face the informational 
issue with a vengeance. For obvious reasons, they often 
require government intervention. 

In a broad sense, search goods include “warranty 
goods.” It is not always necessary to observe quality 
before purchasing a good. If the producer offers to com- 
pensate the buyer in full should the quality of the good 
end up differing from the announced characteristics, qual- 
ity is no longer an issue; the buyer does not worry 
whether the producer's announcement of quality is ac- 
curate. It remains to be seen whether the producer has an 
incentive to give full warranty. It can be shown that, if it 
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is possible ex post to evaluate the quality (performance) of 
the good, and if variations in the good’s performance can 
be attributed entirely to the producer, the producer in- 
deed wants to give full warranty (see the Game Theory 
User’s Manual, example 3 of section 5). The intuition is 
this: Were the producer to give less than full warranty, 
the buyers would become suspicious; they would cor- 
rectly infer that the reason the producer is afraid to give 
a full warranty is that the product is very likely defective 
and thus very likely to lead the producer to pay the 
compensation. Thus, a low warranty would be a signal 
of low quality. In contrast, a full-warranty system makes 
the producer internalize any consumer misperceptions 
and suppresses informational problems. Thus, the qual- 
ity problem could be eliminated by a perfect warranty 
system. 

In many interesting real-world situations, however, the 
warranty system is nonexistent or imperfect. When qual- 
ity means durability, the good must be consumed in order 
for the buyer to know its exact quality; if, as is usual, the 
eventual performance of the good depends on the way 
the buyer consumes it as well as on its intrinsic quality, 
there is a moral-hazard problem on the consumer side: 
The buyer has no incentive to take care of the good if he 
is to be fully reimbursed in case of a breakdown (ie., 
breakdown imposes no cost on him). The natural way to 
proceed is to make the consumer internalize some of the 
costs associated with his behavior by giving a less than 
full warranty. Limited warranties can also be motivated 
by adverse selection considerations; for obvious reasons, 
goods carrying full warranties tend to attract heavy users 
or “high-risk” consumers, while (cheaper) goods asso- 
ciated with lower coverage are purchased by consumers 
who are less likely to benefit from a warranty. 

Moral hazard and adverse selection certainly explain 
many of the restrictions on the warranty system. For 
instance, those parts of a car that are considerably affected 
by the owner's driving style or maintenance behavior 
(e.g. the tires) are unlikely to be guaranteed by the manu- 
facturer. But there are other causes of imperfect warranties 
for experience goods. The quality may be impossible or 
very costly to measure for a court, while still being ob- 
served by the consumer. On the one hand, it may be 
subjective (do the colors of a color TV after one year of 
use “conform to expectations”?). On the other hand, the 
enforcement costs may be incommensurate with the issue 
(the buyer does not sue the manufacturer when a very 
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inexpensive item breaks down), When a manufacturer's 
description of quality involves many attributes and/or 
much uncertainty about the final performance patter 
over the sample of produced items, the warranty not only 
may require high enforcement costs but may also become 
very complex to evaluate for the consumer. 

In this section we consider experience goods. To ob- 
tain clear conclusions, we assume away warranties. We 
first describe the informational issue when the consumers 
buy a good only once. We then suggest how repeat 
purchases and reputation can give a producer incentives 
to supply an adequate level of quality. Repeat purchases 
are studied in much greater detail in the supplementary 
section, 


2.3.1 One-Shot Relationships: Moral Hazard and 
“Lemons” 


2.3.1.1 Moral Hazard 


A manufacturer who sells an experience good to one-time 
consumers and who can neither offer a warranty nor be 
sued for faulty quality has strong incentives to cut quality 
(as long as quality is costly) to the lowest possible level, 
because the market price cannot respond to the unobserv- 
able quality. The minimum level may be a legal standard 
or the level at which consumers can supply convincing 
evidence of underprovision of quality. Thus, there is 
“moral hazard” on the producer side. 

The moral-hazard problem explains, for instance, why 
the food in restaurants in some tourist areas of Paris is not 
what it could be: The transient nature of consumption 
there does not allow reputation to play an important role, 
More generally, for one-shot purchases, the quality chosen 
by a manufacturer is likely to be poor. 

Consider the following simple model (the repeated 
version of which will be studied in the supplementary 
section). Consumers are all identical. They have prefer- 
ences LI = @s — pif they buy the monopolist’s product at 
price p and the product has quality s, and LU = 0 other- 
wise. The monopolist chooses a price p and a quality s. 
The unit cost of production is c, for quality s. The quality 
can be “high” (s = 1; cost c, > 0) or “low” (s = 0; cost co 
in (0, c,]). Assume that @ > c,, so that producing the high 


quality is socially efficient. The monopolist’s profit is 
(p — ¢,) if he sells quality s at price p and 0 if he does not 
sell. (Without loss of generality, we normalize the number 
of consumers to one.) Assume further that the consumers 
do not observe quality before purchasing. It is clear that 
an equilibrium in which the monopolist sells and provides 
high quality cannot exist. The monopolist would save 
; — Cp by cutting quality, and this would not reduce 
demand. If c) = 0, equilibrium in this model consists of 
s = 0 and, hence, p = 0. If co > 0, the market disappears, 
since consumers are unwilling to pay for goods of quality 
0, and so the monopolist cannot recover costs, 

In the simple example above, we assumed that con- 
sumers could not learn the quality of the product before 
buying. In a number of interesting cases, however, some 
consumers do learn information related to a product's 
quality before the purchase. For instance, they perform 
technical tests, or they develop skills that aid them in 
judging quality by simply looking at the product, or they 
read Consumer Reports. Another possibility (considered in 
Bagwell and Riordan 1986) is that consumers enter the 
market sequentially, so that at a given point in time some 
consumers know the quality and some do not. 

The informed consumers exert a positive externality on the 
uninformed ones. By being more demanding, they drive 
up the quality of the monopolist’s product.'° To see 
this, suppose that in the previous model an exogeneously 
given fraction « of consumers are perfectly informed (i.e., 
observe the quality before buying). These consumers are 
willing to pay @ if quality is high and 0 otherwise. The 
remaining 1 — « consumers observe product quality only 
after they purchase. Suppose that the monopolist charges 
a price p in [0, 6]. The informed consumers buy if quality 
is high, yielding a profit «(p — c,), and do not buy if 
quality is low. Consider the uninformed consumers’ be- 
havior. Suppose first that they do not buy. Then the only 
demand comes from informed consumers, and the mo- 
nopolist's optimal choice is the high quality (as long as 
p >). Hence, the uninformed consumers should expect 
quality to be high, and should thus purchase (as we 
assumed that p < @)—a contradiction. Suppose next 
that the uninformed consumers buy. The monopolist’s 
profit is 


10. The following argument borrows elements from Wolinski's (1983) work on 
prices as signals of quality and from Salop's (1977) and Salop and Stiglitz’s 
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(1977) work on price search. See also Farrell 1980, Chan and Leland 1982, and 
Cooper and Ross 1984, 1985. 
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p-% 
if he provides the high quality and 
(1 —a)(p — &) 


if he provides the low quality. He thus provides the high 
quality if and only if 


p—c 2(1—a)(p— co); 


ap > cy — (1 — a)e- 


From this inequality, we can infer two interesting economic 
phenomena. 

First, the monopolist will supply the high quality only 
if the price is sufficiently high. When the price is high. 
the monopolist is afraid of losing his high profit margin 
on informed customers; this makes the low quality less 
attractive. In this sense, high prices can signal high qualities 
to uninformed consumers when there also exist informed 
customers (and the monopolist cannot offer different 
qualities to the different types of customers). Indeed, if 


Ox > c, — (1 —a)eo, 


the monopolist in equilibrium charges p = 8 and provides 
the high quality.! 

Second, our condition is more likely to be satisfied the 
higher the fraction « of informed customers.'? This is 
natural, because the informed consumers are those who 
prevent the monopolist from cutting quality. We thus 
infer that increasing the number of informed customers 
favors efficiency.'* 

The latter conclusion offers an argument for govern- 
ment intervention, Whether consumers become informed 
or not depends on their relative costs of becoming in- 
formed (how bored they get reading Consumer Reports, 
how receptive they are to learning about new products, 


etc.). But when deciding whether to become informed, a 
consumer takes only the private cost and the private 
benefit into account. He does not take into account the 
fact that, by being better informed, he induces (or allows) 
the monopolist to credibly offer the high quality. Thus, 
consumers’ information should be encouraged beyond 
its privately optimal level. One such encouragement 
may be the subsidization of magazines like Consumer 
Reports.* 


Exercise 2.4°* In the previous example, show that if 
ap<c, —(1 —a)eo, 


in equilibrium only a fraction of uninformed customers 
purchase and the monopolist randomizes between high 
and low quality. 


2.3.1.2 The Lemons Problem 


Akerlof (1970) showed that the same issues arise when 
the quality of goods is not a choice variable but putting 
the good on the market is. The basic idea, as before, is 
that if the buyers do not know the quality of the good 
when purchasing, the purchase price must be independent 
of the actual quality. This implies that sellers put their 
goods on the market only if the goods are of low quality; 
otherwise, they are better off “consuming” the goods 
themselves, 

Robinson Crusoe (agent 1) is the monopoly owner of 
a goat. Friday (agent 2) is the potential purchaser of the 
animal. The goat is characterized by s, the amount of milk 
it produces daily (a quality parameter). Robinson has a 
surplus of 6,5 if he keeps the goat and p if he sells it at 
price p. Friday has a surplus of 0.5 — p if he buys at price 
pand zero otherwise. Assume that, for technical reasons, 
milk cannot be traded; only the animal can. Friday's mar- 
ginal valuation for quality exceeds Robinson's: 0, > 
(these parameters are known to both parties). Thus, trad- 


11, Note that 6 — ¢, > (I — 2)(@— cg) and that p = @ is an upper bound on 
what the monopolist can charge informed consumers when quality is high. 


12. The condition requires that p > ¢, which implies that p > co. 


13. In this simple model, the producer realizes all the efficiency gains by 
charging @ for the high quality, More generally, the consumers will obtain some 
extra surplus as well if they have different tastes or if they have downward- 
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sloping demands (and the monopolist cannot price discriminate perfectly—see 
chapter 3). 


14. The monopolist could duplicate the government intervention by contract- 
ing with a third party (a laboratory, for instance) to supply estimates of the 
product's quality. However, the independence of this third party from the 
monopolist is likely to be dubious. Underwriters’ Laboratories and Good 
Housekeeping try to provide such a service nevertheless. 
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ing is Pareto optimal regardless of s. However, trading 
must be voluntary. Robinson has experience with the 
goat and knows s perfectly. Friday knows only that s is, a 
priori, uniformly distributed on the interval [0, spa,] (this is 
the usual distribution for goats, from which Robinson's 
goat has been drawn). Assume further that Friday is risk- 
neutral (his objective function is 6,s* — p, where s* is the 
expected quality of the goat, conditional on its being 
offered for sale). Is there a price p (< 0,5m,,) at which 
Robinson and Friday can agree to trade? Suppose there is. 
Friday should then infer the following: If Robinson is 
willing to sell his goat at price p, the quality s must satisfy 
p 2 9,s. This means that Robinson sells if and only if s 
belongs to the interval (0, p/@,]. Given that s is, a priori, 
uniformly distributed, the average (expected) quality of 
the goat given that it is offered for sale is 


Pe 
8, 


Nie 


s*(p)= 


The average quality is biased downward by the deci- 
sion to put the goat on the market (s“(p) <45ma, for 
P< 4;Smax)- This is the so-called adverse-selection or lemons 
problem.‘§ 

Friday agrees to buy the goat if and only if his ex- 
pected surplus from the transaction is positive, i.e., iff 
0,5°(p) > p or 0, > 26,. If tastes do not differ too much 
in the sense that @, < 26,, there exists no price at which 
Robinson is willing to sell and Friday is willing to buy.1® 
In this case the market breaks down completely. What- 
ever s, a socially desirable transaction does not occur, The 
exercise below develops this example in a bit more detail. 

The reason why the market disappears in this example 
is clear. Suppose that the price is high and the seller is 
willing to sell, but the buyer does not want to buy. The 
traditional way to balance the market is to lower the price 
to reduce excess supply.'’ However, this mechanism may 
not work here. A decrease in price reduces the average 
quality on the market. (The fact that the good is still 


offered becomes, in a sense, “bad news.” The actual qual- 
ity is endogenously given.) This may turn out to reduce 
demand instead of increasing it. 

In general, the market may not disappear; it may only 
shrink because of adverse selection. This situation arises 
in the used-car market, where the frequency of transac- 
tions is certainly smaller than it would be were the infor- 
mation structure about qualities perfect (leaving aside 
other transaction costs). It also arises in the insurance 
market—high-risk consumers are more likely to wish to 
purchase health or life insurance. 

Are there counteracting forces that attenuate this prob- 
lem? We have already mentioned the possibility of repeat 
purchases (which is studied in the next subsection) and 
that of warranties. Another way around adverse selection 
involves multidimensional contracts that specify several 
attributes in addition to the price. The buyer can then use 
the variation in the mixes of these different attributes to 
obtain a relatively satisfactory selection of goods and 
sellers; alternatively, the seller can use the mix he offers 
to signal the unknown characteristics of his good. For 
instance, a consumer with a high probability of accident is 
more eager to be fully insured against an accident than a 
consumer with a low probability. An insurance company 
can introduce two-dimensional contracts—{price, amount 
of reimbursement in case of an accident}—and can select 
low-probability consumers by charging them a lower price 
but reimbursing them only partially in case of an accident. 
(Deductibles are a form of partial insurance.) The organi- 
zation of markets in such circumstances is a fascinating 
topic, but lies outside the scope of this book.'® We simply 
note that this discrimination on the basis of the level 
of reimbursement, if privately optimal, is not socially 
optimal, even conditional on the imperfect information 
structure. 

Government regulation or intervention is quite power- 
ful in some markets; it can take the form of quality controls, 
minimum quality standards, occupational licensing and 


15, Akerlof used as an example the market for used cars, Bad cars are called 
“lemons.” 


16. To be complete, note that a price p > 0;Sqa, always induces the seller to 
sell, The average quality is then Sqa,/2. Again, if 8; <26,, the market cannot 
be in equilibrium. 


17. There is no income effect here, Thus, under perfect information, the 
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"demand curve" is downward sloping, and the Walrasian tatonnement process 
is stable. 


18. See, eg., Rothschild and Stiglitz 1976, Wilson 1977, Stiglitz and Weiss 
1981, Bester 1985, and Hellwig 1986a, On the organization of markets with 
moral hazard with or without adverse selection, see, eg.. Amott and Stiglitz 
1982 and Hellwig 1986b. 
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certification (as for doctors, accountants, and lawyers), or 
safety regulations.'°7° 


Exercise 2.5°**! Potential sellers and buyers of used cars 
are in equal number, N (where N is “large”). The distribu- 
tion of qualities in the population of potential sellers is 
given by the c.df. F(s) with density f(s) on [Spine Smax!- All 
sellers have surplus {49s} when keeping a car with quality 
s, and p when selling it (so all sellers are identical but for 
the quality of their car). A buyer has surplus {@s — p} 
when buying a car of quality s and 0 when not buying. 
Buyers are heterogeneous: @ is distributed on [mins Imax! 
according to the c.d.f. G(®) and the density g(@). Assume 
that Onin < 9 < Qmax- Sellers know the quality of their 
cars, but buyers do not (asymmetric information). 

(i) Determine the efficient volume of trade (ie,, the 
volume of trade under symmetric information) and the 
efficient mix. 

(ii) Compute the demand and supply curves under 
asymmetric information. Explain why the demand curve 
is not necessarily downward sloping (in spite of the ab- 
sence of income effects). 

(iii) Solve for a competitive equilibrium for f and g 
uniform on [0, 1]. Check that trade is suboptimal. 

(iv) Argue that, in general, there can exist multiple 
equilibria. Then assume that this is the case. Show that a 
higher-price equilibrium Pareto dominates a lower-price 
one. Compare with the first fundamental theorem of wel- 
fare economics. 

(vy) Show that a minimum quality standard sy > 0, if 
enforceable, may improve welfare, 


2.3.2 Repeat Purchases 


In the absence of warranties, repeat purchases offer con- 
sumers some means of monitoring quality. By experi- 
menting, consumers learn about the attributes of a prod- 
uct, As long as their current experience is somehow 


related to the future quality, they obtain valuable infor- 
mation as to whether they should repeat their purchases. 
There are two ways in which this mechanism can operate. 
First, the quality of the good may remain the same over 
time. For instance, a consumer may try a wine of a given 
vintage and vineyard. If he likes it, he is likely to like 
other bottles of the same vintage and vineyard. Past con- 
sumption then brings direct information on quality (or 
match). Often, however, the quality may change over 
time. A good meal at a restaurant may not imply that the 
next meal at the same restaurant will also be good. The 
chef may use lower-quality ingredients, spend less time 
preparing the food, and so on. If the producer can change 
quality over time, current quality is not necessarily infor- 
mative about future quality. In such cases, the repeat- 
purchases mechanism must operate in an indirect way. 

Repeat purchases are studied in detail in the supple- 
mentary section; here, a few important ideas are sketched. 

Consider first the polar case in which quality is unalter- 
able, so that consumption brings direct information. For 
the moment, take the quality as given in order to abstract 
from the moral-hazard problem. The only issue for the 
producer of a high-quality item is how to induce con- 
sumers to try his product, Consumers are not reluctant 
to try new products if the psychological cost of trying is 
low and the prospect of many future purchases high. For 
many products, however, the cost of experimenting may 
be perceived as high, even if the trial reveals the true 
quality. Can the producer then use a pricing strategy 
or some alternative instrument to induce the consumers 
to try? 

An apparently natural way to do this is to charge a low 
price in the introductory period. But the consumers may 
still be reluctant to buy if a low price somehow means bad 
news about quality. Hence, we must study whether the 
manufacturer of a high-quality item has more incentive to 
offer a low price to encourage consumers to experiment 


19. Of course, the seller and the buyer could duplicate the terms specified by 
the regulatory agency or the law in their contract. However, in many cases, 
they would not do so because of transaction and enforcement costs. On the one 
hand, it is costly to foresee and write everything in a contract; on the other 
hand, government contro} of quality may be more efficient if the seller sells 
several units—there are increasing returns to scale in enforcement. (The seller 
could set up a control agency exploiting these returns to scale, but the indepen- 
dence of the agency with respect to the seller would be somewhat problematic.) 


20, Leland (1979) examines the effect of a minimum quality standard in an 
Akerlof-type model. Shaked and Sutton (1981) extend Leland’s examination in 
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the context of occupational licensing to allow for the possibility that those 
excluded from a profession by licensing enter a para-profession. Shapiro (1986) 
distinguishes between occupational licensing (an input regulation, which 
a minimum level of human-capital investment by the professional) and 
ion (in which consumers are provided with information about the 
professional's training level, possibly above the licensing level). He shows that 
both government interventions benefit consumers who value quality of service 
highly at the expense of those who do not, He also notes that certification may 
lead to excessive signaling by the professional. 


21, This exercise is inspired by Wilson 1980, 
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than he would have if his product were of low quality. For 
simplicity, assume that quality can be “high” or “low,” ?? 
and that the low quality is so low that it does not gener- 
ate repeat purchases. 

The answer to our question lies in the comparison 
between two effects. On the one hand, a high-quality 
product generates more repeat purchases (Nelson 1974). 
Thus, attracting a customer yields more future revenues 
to the high-quality producer than to the low-quality one. 
Hence, a high-quality producer is more willing to sacrifice 
current profits (charge a low price) to attract customers. 
On the other hand, a low-quality product, for a given 
price, generally yields high profits because of a lower 
production cost (Schmalensee 1978). Thus, from a static 
point of view, a low-quality producer has more incentive 
to attract customers. Because a low-quality producer can 
always duplicate the introductory pricing strategy of a 
high-quality one, the difference in gain due to repeat 
purchases must exceed the cost advantage enjoyed by the 
low-quality producer in order for the price to convey any 
information about product quality. If this condition is met, 
the high-quality producer can sacrifice current profit by 
charging a low price to signal the quality. The low-quality 
producer is not willing to duplicate this sacrifice, because 
customers are not worth as much to him. Hence, under 
some circumstances a low price may signal a high quality. In 
essence, the high-quality producer says: “I will be in the 
market for a while because my quality is high. To prove 
this to you, I am willing to lose money now. You know 
that it would not be in my interest to do so, were my 
quality low. Try me.” 

The signal of quality need not actually be an introduc- 
tory price. Any conspicuous initial expenditure that the 
monopolist can make to “prove” he will be in the market 
for a long time will do. Indeed, Nelson originally argued 


that apparently uninformative advertising can be a signal 
of quality. For instance, a wasteful advertising campaign 
that brings no direct information to consumers may some- 
times be interpreted as a statement of persistence in 
the market. This, of course, raises the question whether 
a wasteful expenditure or an introductory offer is the 
cheapest method to signal quality—a question discussed 
in the supplementary section. 

If persistence in the market is not an issue—i.e., if the 
low-quality producer keeps his goodwill (and is simply 
forced to charge a low price once consumers are in- 
formed)—the high-quality producer may want to sig- 
nal through a high price. This is due to the fact that, 
even under full information about quality, a low quality 
may generate a higher profit margin than a high quality 
because of the cost differential. So it may tum out that 
purchases are more valuable to the low-quality producer. 
In this case, the way to signal high quality is a high 
price to “prove” that the firm is not afraid of contracting 
demand.?* Furthermore, advertising by the high-quality 
producer may then be a necessary ingredient of the op- 
timal signaling package (Milgrom and Roberts 1986). 

The reader may be puzzled by the conclusion that low 
prices can signal high quality. In the discussion of moral 
hazard, emphasis was placed on the idea that, in the 
presence of a fraction of directly informed consumers, the 
producer is more afraid of losing consumers as the price 
becomes higher, and that the producer therefore has a 
higher incentive to provide high quality. The circum- 
stances under which a low or a high price signals high 
quality differ. Low prices correspond to the existence of 
repeat purchases, high prices to that of informed con- 
sumers. A manager planning a strategy to introduce new 
items or an outsider observing the launching of new prod- 
ucts should keep these two models in mind and try to 


22. In the following, “the high-quality producer” will be used as shorthand for 
“the producer when the quality is high.” 

A reader who is not familiar with signaling theory may want to glance 
through the review of the Spence signaling model in the Game Theory User's 
Manual, Here, a low price may signal high quality if it attracts more consumers 
than a high price, and the marginal revenue of an additional consumer is higher 
for a high-quality monopolist than for a low-quality one 


23. In the Bagwell-Riordan (1986) model of intertemporal pricing, the high- 
quality firm signals its quality through high prices for precisely this reason. 
Because in the Bagwell-Riordan model a high-quality product is more costly to 
produce than a low-quality one, a high-quality monopolist has incentive to 
signal with a high price instead of a low one. (A low-quality monopolist has 
more to lose from a demand contraction.) An interesting innovation of their 


Ili 


model is that there is a continual mix of “new” and “old” consumers. Over 
time, the proportion of old consumers grows as some consumers who had not 
yet experimented with the product try it. The growing fraction of old consumers 
(“informed consumers,” in the jargon of subsection 2.3.1.1) reduces the incentive 
to signal, and the high-quality monopolist eventually lowers his price to 
the full-information (high-quality) monopoly price. Bagwell and Riordan thus 
obtain a declining price path (in contrast with the increasing price path obtained 
when signaling is performed through an introductory offer). 

See Crémer 1984 and Riordan 1986 for somewhat different models, in which 
the seller can commit to prices over a period of repurchases and in which a 
declining price path is obtained when the consumers are uncertain of theit taste 
for the product (Crémer) or of the quality of the product per se (Riordan). 
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accurately determine the nature of demand. Naturally, a 
mixing of the two ingredients—informed customers and 
repeat purchases—is likely to blur the informational con- 
tent of prices. For instance, introducing some initially 
informed customers into the repeat-purchase model ought 
to make the high-quality producer more reluctant to offer 
low introductory prices (because he can charge a high 
price to informed consumers); it also makes wasteful 
and conspicuous expenditures such as advertising a more 
likely signal of quality. 


Let us next consider the other polar case, in which the 
monopolist chooses a new quality every “period” (where 
a period means the length of time it takes consumers to 
learn the quality). Repeat purchases can play a role in 
avoiding the moral-hazard problem described in subsec- 
tion 2.3.1.1 only if a high quality yesterday somehow sig- 
nals that the monopolist is likely to choose a high quality 
again today, ie,, if the monopolist has developed a reputa- 
tion for high quality. The supplementary section describes 
two models that explain how “reputation” can form: the 
Klein-Leffler/Shapiro model of quality premia and the 
Kreps-Wilson/Milgrom-Roberts model of asymmetric 
information. 

Roughly, the model of quality premia is based on the 
idea that in a repeated game the consumers can react to 
the monopolist’s choice of low quality by not repeating 
their purchase. This reaction constitutes a punishment for 
the monopolist only if high quality commands a profit 
margin. Such a margin is called a “quality premium.” One 
can thus construct equilibria in which the monopolist 
sustains a high quality through fear of retaliation by con- 
sumers. The consumers purchase as long as quality re- 
mains high. The quality premium must be such that the 
cost associated with the future loss of sales exceeds the 
current cost savings of cutting quality. (This story re- 
quires an infinite horizon. With a finite horizon T, the 
monopolist has incentive to cut quality in the last period, 
and therefore in the next to last period, and so on. The 
market equilibrium then unravels to become the bad equi- 
librium described in subsection 2.3.1.1 repeated T times. 
The asymmetric-information story does not require an 
unbounded horizon.) 


The Kreps-Milgrom-Roberts-Wilson theory of reputa- 
tion is based on the consumers’ incomplete information 
about the monopolist. More precisely, it starts from the 
premise that the consumers are not quite sure that the 
monopolist has an incentive to provide a low-quality 
product even in a one-shot relationship (i.e., that he sup- 
plies high quality “naturally”). For instance, there can be 
at least a small probability that high quality is actually 
not more costly than the low one; or else profit maximiza- 
tion may not be the monopolist’s only objective, and 
“honesty” —the reluctance to provide a low quality when 
one announces the high one—may lead the monopolist 
to provide high quality even when his reputation is not at 
stake. This literature shows that, even if the monopolist’s 
static profit-maximizing choice is a low quality, he may 
want to sustain high quality (a reputation) for a while. 
Doing so, he entertains the possibility that he might sup- 
ply quality naturally (and therefore keep providing the 
high quality in the future); and, because of their confusion 
about the real “identity” of the monopolist, consumers are 
willing to repeat their purchase. If the horizon is bounded, 
the profit-maximizing type milks his reputation at the end 
by cutting quality, which prompts consumers to stop 
purchasing after observing the low quality. The main 
insight of this literature is that even a small probability 
that the monopolist is not profit-maximizing induces the 
profit-maximizing type to develop a reputation as long as 
consumers repeat their purchases sufficiently often (the 
horizon is long enough). The supplementary section dis- 
cusses and compares the two approaches to reputation. 

When are repeat purchases likely to induce a firm to 
provide quality??* Clearly, two necessary conditions are 
that the consumers learn the quality of a purchased item 
sufficiently quickly and that they renew their purchase 
sufficiently often. Only under those conditions does the 
manufacturer have an incentive to provide quality. These 
conditions, it can be argued, can be met by a restaurant 
that caters to a stable population (or, in the case of a chain 
of franchised restaurants, if the franchiser can monitor the 
franchisees’ quality). 

It is convenient to distinguish between two polar types 
of qualities. The first type, which is the one considered 


24. Repeat purchases need not be made by the same consumer at the same 
outlet or of the same good in order to have an effect. First, word of mouth 
between consumers may do the job; one may deal with a real-estate agent or a 
contractor infrequently, but learn about the quality of their recent services 
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through family or friends. Second, brand names and chains can support the 
development of reputation when the repeat purchases concern similar but 
different goods. 
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in the supplementary section, has been described as the 
“vertical-product-space model": The product has some 
characteristic which the consumers enjoy. The consumers’ 
gross surplus associated with the consumption of the 
product at a given instant is an increasing function of the 
amount of this attribute. The second type is durability. 
The consumers’ gross surplus is fixed as long as the prod- 
uct “works.” Quality is then measured by the amount of 
time between purchase and breakdown of the product. 
The reputation mechanism, although certainly possible, 
may be less efficient in this second type of quality. The 
issue is that a high-quality item—i.e., a durable one— 
generates fewer repeat sales than a low-quality one. In the 
extreme case, a product that never breaks down does not 
generate repeat sales; hence, the producer does not want 
to produce such a product, because it is more expensive 
than a low-durability one (this contrasts with Swan's 
optimal-durability proposition, which assumes that the 
durability is observable before purchase).?5 


2.3.3 Quality, Information, and Public Policy 


The mere fact that some products may have low quality 
or fail is not by itself an argument for government inter- 
vention, because the government may be facing the same 
informational difficulties as the consumers. On the other 
hand, there is no reason why the market allocation should 
be efficient even when informational asymmetries are 
taken account of. The purpose of this subsection is to give 
some hints as to when government intervention is likely 
to be desirable.?* 


2.3.3.1 Failure of the Coase Theorem and Product 


Liability 


To assess potential policy interventions, we first assume 
that the government a priori has the same information 
about product quality that a consumer has before making 
a purchase. 

Such an assumption raises the question of what a govern- 
ment can do that cannot be mimicked by the producer 


through contract with consumers. Any efficiency gain 
stemming from government intervention, so runs the 
argument, could also be obtained by the monopolist 
through detailed contract provisions; furthermore, the 
monopolist could appropriate these efficiency gains by 
raising the price. Thus, any desirable government inter- 
vention would already be realized by private means, im- 
plying that there is no scope for such interventions. This 
loose reasoning, it seems, might be supported by an ex- 
tension of the Coase theorem.?’ But, as we shall now see, 
the Coase theorem is not likely to be applicable here. 

The efficiency of private contracts generally requires 
perfect information, the absence of transaction costs, and 
the absence of externalities toward third parties. 


Imperfect Information 


Imperfect information is the basis for the quality issue. 
However, one might think that, even though the equilib- 
rium may not be fully Pareto efficient, it might be (con- 
strained) Pareto efficient relative to the structure of infor- 
mation. (Because the government is assumed to have the 
same information as the consumers, it cannot enlarge the 
scope of private contracts.) This conjecture turns out to 
be false. In a second-best situation, there are generally 
externalities between economic agents that must be cor- 
rected. In a one-shot relationship, informed customers 
exert a positive externality on uninformed ones. The 
government may then improve welfare by subsidizing the 
acquisition of information. 


Transaction Costs 


Transaction costs may lead to incomplete contracts. This 
is particularly true for contracts between a firm and its 
customers. Often, the contract does not exist (is implicit) 
or is of a standard form. To illustrate the problem, let us 
modify our framework slightly and assume that ex post 
some dimensions of quality are publicly observable. For 
instance, consider the case of a soda-pop bottle which has 
a small probability of exploding. A consumer who buys a 


25, For an evolutionary model of product durability, see Smallwood and Con- 
lisk 1979, 


26. A complete study of public policy with regard to product quality lies 
outside the scope of this book. 
There exists a large and interesting literature in law and economics that 
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covers the topics of this subsection (and more). See, e.g.. Shavell 1980, 1984 
and Polinsky 1983, and the legal references therein. 


27. The Coase theorem (1960) asserts that an optimal allocation of resources 
can always be achieved through market forces, irrespective of the legal liability 
assignment, if information is perfect and transactions are costless. (This theorem 
should not be mistaken for the Coase conjecture studied in chapter 1.) 
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soda pop will not bother to require that the manufacturer 
sign a detailed contract; doing so would involve costs 
incommensurate with the surplus associated with the con- 
sumption of this particular good. However, efficiency 
calls for the producer to be liable in case the bottle 
explodes and hurts the consumer. This is required to 
give the firm incentives to produce safe bottles. The law 
may thus be a substitute for incomplete contracts. This is 
actually a major argument in favor of product liability 
(caveat venditor).?® 

Opponents of product-liability legislation might object 
to the previous argument by arguing that the producer 
would be better off supplying a standard-form contract 
that promises to compensate consumers in case of an 
accident. However, the consumers usually do not have 
time to read the contract, or do not understand its sub- 
tleties. Furthermore, there may exist contingencies that 
are not foreseen by consumers; if these contingencies are 
detrimental to the consumers, the producer will not draw 
attention to them,?° For instance, a consumer may foresee 
that the paint he uses for his house may not last, but not 
be aware that this paint may contain lead and poison his 
children, 

We conclude that the Coase theorem is unlikely to 
apply here and that selective government intervention 
may be desirable. 


2.3.3.2 Creation of Information 


We assumed that the government does not acquire supe- 
rior information relative to the consumers. Sometimes 
quality tests may speed up consumer learning.*° This 


applies to experience goods. It applies with a vengeance 
to credence goods, for which information is never ob- 
tained or is obtained slowly and too late; take the ex- 
ample of a chemical substance that causes cancer or affects 
genes. This example illustrates a limit of product liability. 
First, consumers can hardly be compensated for the dam- 
age. Second, a firm that faces the threat of a series of very 
costly lawsuits usually goes bankrupt. This means that the 
punishment is mild relative to the damage.** In such a 
case, product liability is a very incomplete inducement to 
supply the right quality (in the above case, to make sure 
that the chemical substance is not highly toxic). Direct 
quality controls may then complement, or even supple- 
ment, product-liability laws. 


2.3.3.3 Consumers’ Misperceptions 


Product liability has also been supported on the basis of 
consumers’ misperceptions. In this chapter (as in almost all 
of the book), we assume that consumers form their expec- 
tations rationally. Even though they may not be fully 
informed about product quality, they exhibit no syste- 
matic bias in their predictions. Spence (1977) assumes that 
consumers systematically overestimate the probability 
that the product will not fail. The producer, who has 
rational expectations, then gains from offering a low war- 
ranty in exchange for a low price.2? The consumers are 
willing to accept this arrangement, because they wrongly 
perceive the probability of failure as low. In tum, this low 
warranty induces the producer to undersupply reliability. 
Such misperceptions naturally call for the government to 
increase liability beyond the private-contract level.33-3+ 


28. Of course, assigning the liability to the consumer (caveat emptor) may still 
make sense when the main moral hazard lies on the consumer side- 


29, This argument was suggested by D. Scharfstein 


30, Testing could also be performed by consumer associations or the like, OF 
course, information is a public good. It is collectively valuable. However, each 
consumer would prefer other consumers to generate this information, because 
this saves the costs of testing. The usual resolution of a public-good issue is 
through collective action 


31, The punishment should actually be higher than the damage, to correct for 
the probability that the damage is not discovered 


32. Spence assumes that consumers are risk-averse, (If they were risk-neutral 
the solution would be unbounded, as is well known from the theory of bets 
with divergent prior views of the world.) See Spence’s article for an example in 
which a two-part liability—to the customer and to the government (or a third 
party)—is optimal 


33. Of course, the issue is whether consumers do underestimate risks: There is 
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some evidence in this direction (see Calabresi 1970). The remark that “such 
things always happen to the other guy” is common, In particular instances, such 
proclaimed misperceptions may be due to cognitive dissonance; i.e., they may 
be ex post attempts to justify the agent's behavior (carelessness, signature of 
incomplete contracts, etc); this should be investigated. 


34. An alternative model of consumer misperceptions is that of Shapiro (1982). 
Shapiro's consumers have, at each instant of time #, utility L1(q(t),s), which 
depends on the flow quantity purchased and the true quality of the good. 
(sis a general quality parameter and should not necessarily be interpreted as 
reliability.) Instead of knowing or inferring s, the consumers have a (mis)- 
perception of quality R(t), where R stands for reputation. R need not coincide 
with s. Consumers start with some perception R(0). Shapiro, however, allows 
for long-term learning. The reputation grows (or decreases) over time with the 
difference s — R(f). Shapiro shows that, under very general conditions, any 
self-fulfilling quality levels (ie., satisfying R(0) = Rit) = s for all { and being 
part of the solution of the monopolist’s intertemporal profit-maximization 
problem given R(0)) lie below the quality level that would be chosen by the 
monopolist under full information about product quality. A similar result is 
obtained when the monopolist can affect his quality over time. 
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Se 
2.4 Advertising 


Advertising has long been perceived as wasteful and 
manipulative. One reason for this may be that advertising 
is one of the topics in the study of industrial organization 
for which the traditional assumptions (especially those 
with regard to consumer behavior) are strained most. The 
advertising of a product has strong psychological and 
sociological aspects that go beyond optimal inferences 
about objective quality. For instance, ad agencies con- 
stantly try to appeal to the consumers’ conscious or un- 
conscious desire for social recognition, a trendy lifestyle, 
and the like. Advertising also has important economic 
aspects (Dorfman and Steiner 1954; Kaldor 1940-41; 
Nelson 1970, 1974; Schmalensee 1972). Naturally, we 
shall focus on the economic side.3> 

We can distinguish between advertising that conveys 
“hard” (direct) information and advertising at conveys 
“soft” (indirect) information or none at all. Hard informa- 
tion includes the existence of the product, its price, the 
retail outlets in which it is distributed, its physical appear- 
ance, and so on. Most TV commercials provide no infor- 
mation beyond existence. As Nelson (1974) notes, if 
advertising were solely concerned with distributing hard 
information there should be much more advertising for 
search goods (goods whose quality can be assessed be- 
fore a purchase). The data contradict this prediction: 
Experience goods are advertised heavily. 

The Dorfman-Steiner model, reinterpreted (as in foot- 
note 8) to describe advertising as a bunch of messages 
about product existence, offers a simple paradigm of hard 
advertising. The signaling model suggested in section 2.3 
and taken up in the supplementary section concerns soft 
information. Because we have already developed these 
models, and because some interesting features of advertis- 
ing are related to product competition, we will leave the 
analysis of advertising under monopoly at this point.3¢ 


2.5 Concluding Remarks 


Even when product characteristics are observable before a 
purchase (as in the case of a search good), a monopolist 
generally chooses these characteristics in a socially sub- 
optimal way because of his concer for the marginal con- 
sumer rather than the average consumer. Although the 
existence of a bias in product selection is fairly clear, its 
sign is model dependent. For instance, product quality can 
be oversupplied or undersupplied by the monopolist. A 
closely related issue is that of product diversity. If only 
one product is at stake, the monopolist tends to under- 
supply the product (i.e., not to introduce it when it would 
be optimal to do so), because he does not internalize the 
associated net consumer surplus. A multiproduct monop- 
olist, however, can introduce too many goods, because 
pricing one product above the marginal cost may create 
an artificial demand for another of his products. 

A further issue arises in the case of experience goods. 
Then, for moral-hazard and adverse-selection reasons, 
quality tends to be undersupplied. Consumer information, 
repeat purchases, warranties, and signaling by the monop- 
olist (price, advertising) may alleviate this informational 
problem. 

Although the bias in product selection is now rela- 
tively well understood at a theoretical level, work remains 
to be done to see how these models apply to specific 
industries. Also, this chapter puts too much emphasis 
on the positive side. A more careful welfare analysis, 
based on a study of the structure of information (includ- 
ing the government's information) and of the alternative 
policy instruments (fiscal, legal, and regulatory) would be 
desirable. 


35. For a challenging analysis of welfare when advertising affects tastes, see 
Dixit and Norman 1978. See also the comments of Fisher and McGowan (1979) 
and Shapiro (1980). 
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36. This section does even Jess justice to its field than other sections. The 
interested reader will consult, beyond the works already mentioned, the refer- 
ences in Schmalensee 1986. 
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2.6 Supplementary Section: 
Repeat Purchases 


As was suggested above, the possibility of repeat pur- 
chases can operate in two different ways. If quality is 
unalterable, current observation of quality brings direct 
information about future quality. If quality can be manipu- 
lated over time, current quality may still act as a signal. 

We begin our analysis of repeat purchases by assuming 
that the quality of a good is exogenously given and 
unalterable. In this simple framework, there are two main 
questions: (1) What is the value of goodwill to the pro- 
ducer, and should the producer make “introductory of- 
fers” to broaden his goodwill? (2) If the producer knows 
the quality of the good, can he signal this quality through 
a judicious choice of price, advertising, etc.? 

To distinguish the first question from the second, we 
develop a model in which the producer and the con- 
sumers have the same imperfect information. The infor- 
mational problem arises only from the fact that a given 
consumer does not know in advance whether he will like 
a product. Liking a product is a matter of “match” be- 
tween the consumer and the product rather than a matter 
of quality, in that the probability of liking the product (a 
measure of quality) is known. The issues associated with 
the producer's superior knowledge of quality (question 2) 
are then taken up. That discussion is followed by an 
analysis of reputation in models with sequential moral 
hazard. 


2.6.1 Repeat Purchases in the Absence of Moral 
Hazard 


2.6.1.1 Goodwill and Introductory Offers?” 


Consider the following matching model. There are a 
large number of consumers. A consumer with taste @ has 
the following per-period preferences for the good pro- 
duced by the monopolist: 


@s —p_ if he buys at price p 
u= : 
o otherwise, 


where s denotes the quality to the consumer (see section 


2.1). Quality here is purely “idiosyncratic” or “subjective” 
—in other words, it is not correlated between consumers. 
There are two potential levels of idiosyncratic quality: 
s = 0 (“no match”) and s = 1 (“match”). The probability 
of a match between the consumer and the product is x, 
belonging to (0, 1). As was mentioned earlier, x is com- 
mon knowledge. The taste parameter @ is distributed over 
the population of consumers with the cumulative distribu- 
tion F(@), We normalize the population to be equal to 1 
(without loss of generality), The unit cost of producing 
the good is c. 

There are two periods: t = 1, 2. The monopolist charges 
p, and p,. For simplicity, we assume that he cannot com- 
mit himself to a second-period price at date 1.°° Con- 
sumers who have purchased the good in period 1 know 
whether they like it or not; they purchase in period 2 if 
and only if they liked the product in period 1 and @ > p,. 
By the law of large numbers, the proportion of consumers 
who like the product is x. 

First, consider the case in which the consumers and the 
producer are myopic. Their common discount factor 6 is 
then equal to 0. In the first period, a consumer with 
parameter @ buys at price p, if and only if his expected sur- 
plus from buying is positive: E(8s) — p, 2 0, or 8 > p,/x. 
Thus, the demand at price p, is 1 — F(p,/x), The monop- 
olist’s first-period profit is 
(py — [1 — Fp, /2)). 

For the consumers, a price p, associated with a probability 
x of liking the product is equivalent to a price p,/x to be 


paid only if they are satisfied (if such a contract can be 
enforced); 


Ox — p, = x(0 — p,/x). 

Let us thus introduce p, = p,/x, and let ¢=c/zx. The 
monopolist’s first-period profit can be written as 

x(p, — 2) [1 — F(p,)}. 


This profit is nothing but the profit that the monopolist 
would realize if the fraction x of consumers who are going 
to like the product knew it in advance, except that the unit 
production cost is the production cost “per satisfied cus- 


37, This section is based on Farrell 1984, 1986 and Milgrom and Roberts 1986, 
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38. As in Farrell 1986, little would be affected by the possibility of commit- 
ment 
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tomer,” Z, instead of c. The real cost is higher than c 
because the monopolist cannot predict matches. Let p™(y) 
denote the monopoly price under full information when 
the unit cost is y; i.e., p™(y) maximizes x(p — y)[1 — F(p)]. 

In the first period, the monopolist chooses p, = p™(é). 
Because @ > c, p, > p™(c) (see chapter 1). Or, p, > zp™(c). 
We conclude that the price exceeds the full-information 
monopoly price adjusted by the probability of the con- 
sumers’ not liking the product. This is due to the fact that 
the unit cost per customer satisfied exceeds the unit cost 
of production (unless the latter is 0). 

Consider now the second-period price in this myopic 
setting. The second-period demand curve is shown in 
figure 2.3. The monopolist has two strategies: to cater 
only to his goodwill (first-period) customers and to try to 
reach new customers. Let us start with the first strategy. 
The marginal consumer in period 1 has parameter j, 
= p™é) > p™(c). Since the first-period customers are per- 
fectly informed and the marginal cost of production is c, 
the monopolist charges p, = p,.?° The monopolist’s sec- 
ond period profits is then 


x(p,y — [1 — F(p,)). 


The second strategy, charging p, < xp, attracts new 
customers. The monopolist’s second-period profit is then 


(po — o) {xl1 — F(P,)] + FCP) — F(p2/2)}. 


Such a strategy is dominated—notice that the profit can 
be written as 


(py — 6) {xl1 — F(p,)] + F(R.) — Fle)} 
<x(f, — £)[1 — F(p,)1L 


where p, = p,/z. 

Thus, with myopic consumers and a myopic monopo- 
list, the monopolist charges p. = p,/x and caters exactly 
to his first-period customers (those with whom he has 
established goodwill). The second-period price is higher 
than the first-period one; this is natural, as the customers 
know that they like the product. 

Does this mean that the monopolist makes an “intro- 


x(1 ~ F(p,)) 
(cater to the goodwill only) 


x(1 — F(B,)) + F(B,) ~ F (Re/x) 
(reach new customers) 


> 92 


x(1 ~ F(R) 


Figure 2.3 
Demand curve with goodwill, 


ductory offer’? Notice that the consumers would, in the 
first period, be indifferent between paying p, for sure and 
paying p, conditional on their liking the product. Let us, 
with Farrell (1984), define an introductory offer as a first- 
period price that is strictly lower than the second-period 
(full-information) price times the first-period probability 
of the customers’ liking the good.*° In this sense, the 
monopolist makes no introductory offer. 

Let us more generally consider nonmyopic parties, The 
monopolist and the consumers have a common discount 
factor, 6. A first intuition would suggest that, when taking 
a dynamic perspective, the monopolist should try to ac- 
cumulate a large clientele in the first period so as to take 
advantage of it in the second period. This intuition, how- 
ever, is flawed (Farrell 1986; Milgrom and Roberts 1986). 

As in the myopic case, it is easy to show that the 
monopolist does not try to reach a new clientele in the 
second period. Contrary to the case of a durable-good 
monopolist discussed in chapter 1, a period-2 decrease in 
price to reach new customers also benefits period-1 cus- 
tomers. Overall, the monopolist is better off resting on 
his laurels and milking his goodwill. 

The interesting question is whether the monopolist 


39. The reader may wonder whether a concavity assumption on the profit 
function is not required for this argument. A more careful argument runs as 
follows: The monopolist's profit function for prices exceeding p, is the same in 
the second period as in the first, except that the unit cost has become c instead 
of @, Thus, the second-period price cannot exceed the first-period price. On the 
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other hand, prices under p, either do not attract new customers (and then are 
dominated by },) or attract new customers (a case studied below). 


40. More generally, one could adjust each period's price by the expected 
quality of the good at the beginning of the period. 
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would like to cater only to part of his goodwill clientele. 
To show that such a strategy is not profitable requires 
two steps. First notice that, if this were the case, the 
marginal consumer in period 1 would make a myopic 
decision in that period, because he knows that he will not 
buy in period 2. Hence, the demand at price p, = zp; is 


1— FCp,), 


and the firm's first-period profit is the same as in the 
myopic case: 


x(py — [1 — F(A, )) 


Second, suppose that the firm charges p, > p, in period 2. 
Only consumers with parameters @ > p, will purchase, 
conditional on their having liked the good in period 4, 
Hence, the second-period profit is 


x(pz — c)[1 — F(p2)), 


Because c < é p, cannot exceed p,. Hence, even with a 
dynamic perspective, the monopolist makes no introduc- 
tory offer. 


Exercise 2.6" A dynamic perspective, however, leads to 
a greater goodwill. Show that the first-period and second- 
period prices decrease with the discount factor 5, Give the 
intuition for this result. 


The no-introductory-offer result can be interpreted as 
meaning that there is no point in “bending over back- 
wards" to attract customers to whom one does not intend 
to cater later. 


2.6.1.2. Signaling an Existing Quality 


Let us now turn to the possibility that a producer who 
knows the quality of his good signals it through his 
choice of price, advertising, and so on. Modify the pre- 
vious model in two respects. First, assume that s is per- 
fectly correlated between consumers instead of being 
uncorrelated—in other words, that all or none of the con- 
sumers like the product. Then s is an objective measure 
of quality (vertical product space); s = 0 denotes a low 
quality and s = 1a high quality. Second, suppose that the 


monopolist knows the quality of his product, whereas the 
consumers lear it only if they try the product. Assume 
further that only consumers who try the product learn the 
quality.** Let co (c,) denote the unit cost of producing a 
low-quality (a high-quality) item. 

The monopolist would always want the consumers to 
believe that the quality is high, regardless of the true 
quality. The consumers ought to understand this, and the 
question arises whether the monopolist can credibly sig- 
nal a high quality. 

In this model, the monopolist has two potential instru- 
ments for signaling: the price of the product, and con- 
spicuous (and wasteful) expenditures. An example of the 
latter is an advertising campaign that conveys no direct 
information about the quality of the product. There is 
some similarity between the two instruments (for in- 
stance, from the monopolist’s viewpoint, an introductory 
offer is a bit like throwing money away). They differ 
in that introductory prices entail a cost proportional 
to demand, whereas wasteful expenditures are a fixed 
cost. Also, consumers enjoy introductory offers, whereas 
wasteful expenditures bring them no direct benefit. 

Nelson (1974) suggested that wasteful expenditures (or 
introductory prices) may signal quality. His idea is that a 
high-quality product generates repeat purchases. There- 
fore, the future returns from the formation of goodwill are 
higher for a high-quality product than for a low-quality 
one. Suppose that consumers associate high quality with 
some minimum amount of wasteful expenditures, Then 
the monopolist may want to waste this amount if the 
quality of his product is high in order to create good- 
will and reap future benefits. A monopolist producing a 
low-quality product may not want to do so. In this case, 
the consumers’ expectations are fulfilled, even though 
wasteful expenditures convey no direct information about 
quality. 

Schmalensee (1978) pointed out that it is not clear that 
a high-quality monopolist has a higher incentive to attract 
customers than a low-quality one. Although the high- 
quality monopolist’s future gains from doing so are likely 
to be higher (per Nelson's argument), his current gains 
may well be lower. This is due to the fact that a high- 


41, If we assumed that consumers can lear by word of mouth or by reading 
Consumer Reporis, all consumers would know the quality in period 2 regardless 
of what happened in period 1 (if aggregate consumption were not zero). Thus, 
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the dynamics of learning would be uninteresting. Neither the firm nor the 
consumers would take a dynamic perspective, Intermediate assumptions are 
also possible, 
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quality monopolist is likely to have higher production 
costs than a low-quality monopolist (c, > cg). Thus, for a 
given first-period price p,, the profit margin per customer 
is higher for a low-quality monopolist: p, — co > py — ¢1- 
This effect may render wasteful expenditures ineffective 
as a signal of quality. Klein and Leffler (1981), and espe- 
cially Kihlstrom and. Riordan (1984) and Milgrom and 
Roberts (1986), have treated the issue in more detail. 

The formal signaling model was developed by Spence 
(1973) to describe how efficient workers can signal their 
ability to employers by engaging in (possibly wasteful) 
education expenditures. Transposed to our quality prob- 
lem, the moral of Spence's analysis is that a high-quality 
producer is able to use an instrument—e.g. price or 
wasteful expenditures—to signal his quality if his cost 
(his return from) using this instrument is lower (respec- 
tively, higher) than that of a low-quality producer. 

We now consider an example in which an introduc- 
tory price and wasteful expenditures are essentially equiv- 
alent, so that the monopolist has only one way of 
signaling quality. (Multidimensional signaling, a more 
complex issue which will be considered briefly later, re- 
quires the investigation of which instrument is the less 
costly signal.) 

Suppose that all consumers are identical (i.e., have the 
same taste parameter 8). Their gross surplus is @ if the 
good is of high quality and 0 if it is of low quality. For 
simplicity, assume that only consumers who have con- 
sumed at date 1 can consume at date 2. (As the preceding 
subsection suggests, this assumption could be relaxed 
without a change in the results.) We will assume that 
10 < Cg, where x is the a priori probability that quality is 
high. As we will see, this condition implies that the high- 
quality producer is driven out of the market by the pos- 
sibility of low quality if he cannot signal his product's 
quality. (In the absence of new information, consumers are 
willing to pay x0.) Let A denote the level of wasteful 
expenditures in the first period. (As is easily seen, there is 
no need to engage in wasteful expenditures in the second 
period.) These expenditures are ostentatious, in that they 


are observed by consumers. if all consumers buy the 
good, the monopolist's first-period profit is p, — c— A. 
Thus, an increase in A is equivalent to an equivalent 
decrease in p;. In the following we will focus on price 
signaling, assuming that A = 0. When signaling by price 
and wasteful expenditures are indeed equivalent, it will be 
noted. 

Suppose that consumers buy in the first period. In the 
second period, they do not repeat their purchase if the 
quality is low; they do repeat it if the quality is high and 
the second-period price p, does not exceed 0. Thus, the 
high-quality monopolist charges p, = @ in the second 
period. 

Can the first-period price signal (reveal) the quality? 
Suppose it does. A low-quality monopolist is then unable 
to sell and makes zero profit. The high-quality monopolist 
charges a price p, and makes a profit of 


Tl, = (pi, — 4) + 6 — q). 


For this to be an equilibrium, the monopolist must not be 
willing to duplicate this strategy when his good has low 
quality (otherwise, he would do so and p, would not be 
informative). His profit from doing so would be p, — co, 
since he does not generate repeat purchases. Thus, we 
require p, < co. This implies that 


Ty < 68 — 4) — (4 — ¢0). 
Hence, there are two cases: 


+ 5(0— c,) <c — co. In this case, necessarily, Tl, <0. 
There exists no revealing (or separating) equilibrium. In 
equilibrium, the monopolist charges the same price, re- 
gardless of the quality of the good. The price is thus 
uninformative. If x denotes the a priori probability that 
the good has high quality, the highest price that can be 
charged in the first period is p, = x0. The high-quality 
monopolist produces only if (x@ — c,) + 6(6 — c,) > 0. 
If this condition is not satisfied, the high-quality monop- 
olist does not produce. Neither does the low-quality 
monopolist, in this model.*? Thus, the possibility of a low 


42. To be more formal: If the high-quality producer sells, the low-quality 
‘one makes a strictly positive profit, as he always has the option of duplicat- 
ing the high-quality strategy. However, from Bayes’ law we know that there 
must exist some equilibrium price p, such that the consumers’ posterior 
beliefs following p,, x(p,). satisfy [r'(p,)@ — ci] + 6(0 — c,) <0 (because 
1 =J,,7(p.)Prob(p,), where Prob(p,) is the probability that p, is charged in 
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equilibrium and where (x — c,) + 6(0 — c,) <0 by assumption). As consumers 
are not willing to pay more than x'(p,)6, the high-quality monopolist does not 
charge p;; hence, x'(p,) = 0. This means that the low-quality monopolist does 
not charge p, either, which contradicts our assumption that p, is an equilibrium 
price, If the high-quality monopolist is out of the market, so is the low-quality 
‘one, because consumers are not willing to pay for a low-quality item. 
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quality may drive a high-quality producer out of the 
market, as in the lemons problem.*? 

+ 6(0—c,) > c, — co. In this case, there exists a revealing 
equilibrium in which the monopolist charges p; = ¢p if his 
good has high quality. He actually does not need to 
charge under cg to signal high quality, because a low- 
quality monopolist would never charge below co, regard- 
less of how such an action would be interpreted.** The 
low-quality monopolist does not sell. 

Thus, a revealing equilibrium may or may not exist. 
The cost of signaling is independent of quality (a reduc- 
tion in price reduces the monopolist’s profit uniformly 
over qualities), but the return to signaling depends on 
quality. The existence of a revealing equilibrium hinges 
on the comparison of two effects. The first is the Nel- 
son effect: A high-quality producer generates repeat pur- 
chases, with value 6(@ — c,). The second is the Schmalen- 
see effect. A given demand generates a cost differential 
(c, — €g) in favor of the low-quality producer (assuming 
that c; > Co—if c, < co, the revealing equilibrium always 
exists). The price can be revealing only if the Nelson 
effect dominates the Schmalensee effect. 

The high-quality monopolist in a revealing equilib- 
rium charges below marginal cost in the first period (if 
9 <) and makes an introductory offer (in the sense 
given above). 


Remark Let us briefly consider wasteful expenditures 
as a substitute signaling device. Letting p, denote the 
high-quality monopolist’s price, we have p, — A = co in 
a revealing equilibrium. The consumers’ utility is 6 — p, 
= 8 —cy — A_In order for the consumer to purchase, @ 
must be > cy + A, Thus, only pairs {p,,A} such that 
py — A=cy and A <0 — Co can form a revealing equi- 
librium. Thus, as long as A does not exceed 6 — Cg. 
wasteful expenditures and introductory prices are perfect 
substitutes. Needless to say, consumers prefer introduc- 
tory prices to wasteful expenditures. Hence, if an equilib- 
rium that Pareto dominates all other equilibria is postulated 


to be the “right” or “focal one, there are no wasteful 
expenditures in equilibrium. 


The conclusion of this study is that low prices (or 
wasteful expenditures) can signal high quality only if the 
cost differential between qualities is small relative to the 
repeat-purchase premium of high-quality products. 

The result that high quality must be signaled through 
a low price depends crucially on our assumption that low 
quality would not be profitable under full information (the 
unit production cost cy) exceeds @s,, which here is zero). 
Under this assumption, the low-quality monopolist sells 
for one period at most, i.e., is constrained to use a “fly-by- 
night” strategy. Because at any given first-period price 
a low-quality monopolist earns larger short-term profits 
than a high-quality one, the high-quality monopolist can 
reveal his type only by charging a price that makes it 
unprofitable in the short term for the low-quality monop- 
olist to pretend to be a high-quality one. 

The picture changes dramatically if the low-quality 
monopolist can earn profits under full information, i-e., 
if @s5 > cg. Under the assumptions that either all con- 
sumers buy or no consumers buy (remember, consumers 
are identical) and that (#59 — c,) + 5(8s, — ¢) > 0 (high 
quality is profitable even if it is initially thought to be 
low quality), the first-period price is not at all informa- 
tive. Hence, the price charged by the firm is independent 
of the quality of its product. To see why, first note that 
both types will sell in equilibrium. In particular, they can 
always generate sales by charging p, = 959 and, thus, 
make positive profits. Second, as all equilibrium prices are, 
by assumption, accepted by all consumers, both types 
ought to charge the highest equilibrium price, which 
means that the price is uninformative. 

If the low-quality monopolist can eam profits under 
full information, and if consumers differ (in contrast with 
the previous model), then the high-quality monopolist 
may signal the quality of his product through a high or a 
low price, depending on the values of the parameters** 


43. The analogy with the lemons problem is twofold. First, because of the 
impossibility of signaling. the producer must charge the same price regardless 
of quality. Second, the high-quality producer has a higher cost of producing 
than the low-quality one: this is similar to Crusoe’s being mare willing to keep 
his goat when the goat is more productive 


44, Dominated strategies (charging under co for a low-quality monopolist) are 


implicitly eliminated here, This sulfices to- obtain a unique revealing equilibrium, 
To obtain a unique equilibrum. one could apply the intuitive eritenon (Cho and 
Kreps 1987), which refines the notion of equilibrium in dynamic games of 
incomplete information. For a mare formal treatment, see the Game Theory 
User's Manual 


45. The following discussion (s inspired by Milgrom and Roberts 1986 
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—a result opposite to the one obtained in industries 
where the producer tries to convince the consumer that 
he will still be around after consumer learning. The point 
is that under consumer heterogeneity a high price 
generates lower sales than a low price.*® The value of 
goodwill may be higher or lower to a high-quality 
producer than to a low-quality one. It may be higher be- 
cause the high quality allows the manufacturer to extract 
more consumer surplus in the second period; it may 
be lower because high quality costs more, so that 
the high-quality manufacturer may want to supply fewer 
customers. If prices signal quality at all (which requires 
some restrictive conditions), high quality is signaled 
through a low (high) price if the first (second) effect 
dominates. The following exercise illustrates these two 
possibilities.*” 


Exercise 2.7*** Suppose that in the previous model there 
are two types of consumers, with taste parameters 6, and 
9 (8; > 8), in proportions q, and 1—4q,. The high 
quality is s, and costs c, per unit. The low quality is 
Sq and costs cg per unit (c, > co). Consider two sets of 
parameters: 


5, = {cq = $5 =0; 0, = 2; =1;5, = 1; 
So=}:0=13<m <3} 

and 

Sy) = {oy =o = 16, = 2:0 =1e 5, =2; 
S$ = 16 =1; 4, <4}. 


(i) Assume first that consumers know product quality. 
Show that under S, the high-quality monopolist sells only 
to 6, consumers and the low-quality monopolist sells to 
all consumers. Show that this conclusion is reversed under 
Sy. 

(ii) Suppose that consumers learn the quality in the first 
period, and that only consumers who have purchased in 
that period can purchase in the second period. Show that 
under S, there exists a revealing equilibrium in which 
the high-quality monopolist charges a higher price than 


the low-quality monopolist. Show that under 5, there 
exists a revealing equilibrium in which the high-quality 
monopolist charges a lower price than the low-quality 
one. 


Remark Bagwell (1985) provides an alternative sig- 
naling explanation of introductory offers, according to 
which the monopolist signals not the quality of his good 
(which is known to the consumers) but his production 
cost. In a world where consumers must decide each period 
whether to obtain at a cost a price quotation from the 
monopolist (i.e., to engage in search), a low first-period 
price may encourage consumers to come back and get a 
second-period price quotation. A low-cost producer bene- 
fits more than a high-cost producer from repeat pur- 
chases, which makes the firm willing to sacrifice some 
profit in the first period. Hence, consumers are vindicated 
in their beliefs that a low first-period price signals a 
low second-period price. This leads to a theory of shops 
with a reputation for low prices. The idea is similar 
to the reputation story developed below in subsection 
2.6.2.2. 


2.6.2 Adjustable Quality and Reputation 


In the previous subsection we focused on how consumers 
can learn an unalterable quality. As mentioned earlier, 
there are many instances in which a producer can affect 
quality over time. In this subsection we study the other 
polar case, in which the monopolist makes a new quality 
decision each period. The question then arises as to how 
repeat purchases can discipline the producer. Is there 
any mechanism that prevents him from choosing the 
cheapest quality each period? Obviously, if such a 
mechanism exists, it must be the case that the observation 
of a low quality today generates pessimistic consumer 
expectations for the future. We now consider two 
models in which such expectations are rational. In both 
models, we assume that the monopolist’s choice of 
quality is learned by all consumers one period after the 
purchase. 


46, At least it must be so in an equilibrium; otherwise, the low price would 
never be charged. 


47. Milgrom and Roberts (1986) offer examples in which the high-quality 
monopolist uses both advertising and prices to signal. See their paper for more 
details 
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2.6.2.1 Quality Premia and Bootstrap Equilibria 


We first consider a simplified version of the Klein-Leffler 
(1981)-Shapiro (1983) model of quality premia.** Con- 
sider the model of subsection 2.6.1.2. There are two 
potential qualities: low (s = 0) and high (s = 1). The unit 
cost of producing the low (high) quality is co (respec- 
tively, c, > co). Consumers are all identical and have per- 
period surplus @s — p if they buy at price p a good with 
quality s, and 0 if they do not buy. In this subsection we 
modify the model in three respects. First, the monopolist 
chooses quality in each period (in 2.6.1.2 we assumed that 
quality was exogenous), Second, there are an infinite 
number of periods: t = 1,2,.... As we shall see, the 
results cannot be derived in a two-period or, more gen- 
erally, in a finite-horizon model. Let 6 = 1/(1 + r) denote 
the discount factor. Third, all the consumers learn at the 
beginning of period t+ 1 the quality chosen by the 
monopolist at date +. This may result, for instance, from 
an article in Consumer Reports or from efficient word-of- 
mouth communication between consumers. We make this 
assumption to abstract from the considerations concern- 
ing the number of informed consumers, which were cen- 
tral to the previous subsection 

If there were only one period, the producer would have 
an incentive to choose quality s = 0 regardless of the 
price he charged, and the consumers would not purchase 
the good, Thus, in the one-period model there is an ex- 
treme lemons problem*® (see section 2.3). 

Let us look for an equilibrium of the following type: 


* Consumers base their expectations of quality on the 
firm’s “reputation.” This reputation at | is measured by 
the quality chosen by the monopolist at t — 1: R, = 5,-1. 
So, they expect s, = R,. Assume R, = 1 (the consumers 
have a favorable prior) 

+ The monopolist starts with a price p; and keeps on 
charging the same price. He also always provides the high 
quality. If he were to deviate and provide the low quality 
in some period, then he would keep providing a low 
quality from then on and charge pg = 0; the consumers 
would then stop buying 


48. See also Allen 1984. Rogerson (1987) allows producers to differ in their 
cost structures and provides further support for the equilibria selected by 
Shapiro (see below) 


These strategies depict a special form of reputation: 
The consumer believes that the monopolist will keep pro- 
ducing the same quality whatever this quality is. And, the 
monopolist indeed maintains the same quality. 

We will now see that, for a judicious choice of p,, these 
strategies form an equilibrium. We must demonstrate that 
the consumers’ apparently naive expectations are indeed 
rational (they do not fool themselves), and that, given the 
consumers’ expectations and for any initial reputation, the 
monopolist maximizes his present discounted value of 
profits. 

The consumers’ expectations are trivially rational, be- 
cause the monopolist’s strategy dictates that in each period 
he chooses the quality chosen a period earlier. 

If the monopolist follows his prescribed strategy, he 
obtains an intertemporal profit of 


ATO 
t=.5 


= C = “Yao — 4%). 
r 


If instead he were to deviate and sell a low quality at 
price p; (he might as well charge p, if he produces the 
low quality), he would obtain p, — co in the deviation 
period and 0 thereafter. Hence, a necessary condition for 
equilibrium is that the “fly by night” strategy be 
unprofitable: 


1+r 
( Vin =a) — fo 


(nA — 4G) I +d +67 +-- 


r 
or 
Pi — 4 BG — co) 


Thus, in order for the monopolist not to cut quality, the 
high-quality price must command a premium: It exceeds 
marginal cost by at least r(c. — Co). This is easily under- 
stood. By cutting quality now, the monopolist saves on 
production cost (c, — co). However, he loses his reputa- 
tion rent: 


(py — (6 + 8? + =) = (py — )/r, 


49. In the lemons problem, the seller does not choose the quality; rather. he 
chooses whether to sell the product. However. the situation is somewhat 
similar to the one considered here. in that the seller offers his product only if it 
is of low quality 
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Next, p, must be such that the monopolist would not 
want to rebuild his reputation if he lost it. To do so, he 
could sell for one period at zero (or slightly negative) 
price and high quality. This would cost him c, in the short 
run and would bring back the reputation rent (p, — ¢,)/r. 
Thus, we need 


=a + (pa )/r <0 
or 
aa <r. 


(We must also have p, < @ in order for the consumers to 
purchase the good.) 

The moral of this model is straightforward: The pro- 
ducer has an incentive to produce a high-quality item 
only if high quality implies a rent that the producer is 
afraid of losing if he cuts quality.6° The minimum quality 
premium r(c, — co) increases with the rate of interest. 
Suppose, for instance, that the time lag between periods 
increases. This means that the consumers take longer to 
observe past quality. For a given interest rate per unit of 
time, the interest rate per period grows with the informa- 
tion lag. The monopolist is more tempted to cut quality, 
because such a cut takes longer to be detected. The qual- 
ity premium must increase to keep the monopolist from 
cutting quality (milking his reputation). 

Finally, consider the bootstrap aspect of this equilib- 
tium. Reputation matters only because consumers be- 
lieve it matters. If they did not, and if they believed 
instead that, whatever the past qualities, the monopolist 
would supply the low quality in the future, the monopo- 
list would have no incentive to sustain quality, and the 
consumers’ expectations would again be fulfilled. Thus, 
we have another equilibrium with an extreme lemons 
effect: Whatever the history, the monopolist always pro- 
duces products of the lowest quality, and the consumers 
expect him to do so. In this equilibrium. repeat purchases 
have no effect. Thus, the analysis suggests only that repe- 
tition may offer incentives to supply quality, not that it 
necessarily will. 

This second equilibrium is actually the one that would 


50. This idea isnot new Klein and Leffler offer the following quote from Adam 
‘Smith: ”,. the wages of labour vary according to the small or great trust which 
must be reposed in the workman. The wages of goldsmiths and jewellers are 
everywhere superior to those of many other workmen, not only of equal, but 
of much superior ingenuity, on account of the precious metals with which they 


be obtained if the horizon were finite. Let T denote 
the horizon. At date T. the monopolist has a dominant 
strategy: to produce the lowest quality (for any price that 
he charges). This is because there is no future, and the 
problem is identical to a one-period problem. Thus, con- 
sumers should expect s = 0, whatever the past. As qual- 
ity at date T is independent of what happens at date 
T — 1, the quality decision at T—1 is made without 
regard to the future, and so it is again the same as 
in the one-period model. The monopolist produces the 
lowest quality regardless of history, By backward induc- 
tion (see chapter 11), the lemons problem folds back 
and the monopolist produces the lowest quality in each 
period. 

The infinite-horizon model can also be interpreted as a 
model in which, at each period, there is a fixed probability 
y in (0,1) that the market “disappears” (because of the 
introduction of a superior product or a change in taste, for 
instance). The discount factor to be used in computing the 
equilibria is a modified one: 


Fs en 
“\itr, w 


where r is the rate of interest per period. 

The monopoly version of the model can be recast in a 
competitive mold by allowing free entry by firms, where 
free entry implies that each producer's intertemporal pro- 
fit must be 0 (otherwise, other firms would come in). A 
zero profit requires that the firm loses some money when 
entering because the flow of subsequent profits (p, — ¢,) 
is positive. For instance, the firm may make an introduc- 
tory offer below marginal cost c, (with the specification 
that if it does not make this introductory offer or if it does 
not supply the high quality, it will not be trusted by the 
consumers in the future). Alternatively, the firm could 
sink some money into uninformative but conspicuous ad- 
vertising. See Shapiro 1983 for more on this issue 


2.6.2.2. Asymmetric Information and Reputation 


The Klein-Leffler/Shapiro theory of quality premia is eco- 
nomically appealing. It relies strongly on the unbounded- 


are intrusted. We trust our health to the physician’ our fortune and sometimes 
our life and reputation to the lawyer and attorney. Such confidence could nut 
safely be reposed in people of a very mean or low vondition Smith |*e 
p- 105) 
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ness of the horizon, however; furthermore, even for an 
infinite horizon, the positive effect of repeat purchases on 
product quality is a possibility, not a certainty (there are 
many equilibria, including the one in which no reputa- 
tion forms). Kreps and Wilson (1982) and Milgrom and 
Roberts (1982) showed that reputation effects can be 
obtained even with a finite horizon, as long as the con- 
sumers are not informed about the firm's technology or 
its objective function.5! The idea is the following: Sup- 
pose that in the one-period problem there is a positive 
probability, from the consumers’ point of view, that the 
monopolist does not want to provide the lowest quality. 
For instance, the monopolist might be “honest,” ie. be 
reluctant to supply a low quality when he has announced 
a high quality, even though that strategy would yield a 
higher monetary payoff. Alternatively, there could be a 
positive probability that the monopolist’s cost of produc- 
ing the high quality does not exceed that of producing 
the low quality. Thus, consider two potential types*? for 
the monopolist. The “dishonest” type is as described in 
subsection 2.6.2.1; in the one-period and T-period games, 
he would provide the low quality if the consumers were 
aware of his type. The “honest” type always provides the 
high quality. 

Suppose that only the monopolist knows his type, and 
consider a two-period horizon. The dishonest type may 
behave as if the consumers knew his type from the start 
and provide a low quality in the first period. The con- 
sumers would then infer that he is of a dishonest type and 
would expect a low quality in the second period. But 
another strategy may prove superior: The dishonest type 
can provide a high quality in the first period in an attempt 
to convince the consumers that he is honest. The con- 
sumers then expect a high quality in the second period 
and are willing to pay for it. The dishonest type can then 
milk his reputation by providing the low quality at a high 
price.** This strategy entails a first-period loss (due to 
investment in reputation), which is offset by a second- 
period gain from reputation. The story might, for in- 
stance, apply to a restaurant that offers high-quality food 


for a year or two, then reduces quality, and finally shuts 
down. 

Consider a more formal version of this argument in 
which a monopolist chooses between two qualities, s = 0 
and s = 1, in each of two periods (f = 1,2). The high 
quality costs c,, the low quality co (¢, > co > 0). The 
consumers are all identical and have a per-period utility 
of (Os, — p,) if they purchase and 0 otherwise. The mo- 
nopolist can be of two potential types: “honest” (with 
probability x,) and “dishonest” (with probability 1 — x,). 
The monopolist knows his type, but the consumers do 
not. An honest monopolist always supplies the high qual- 
ity. A dishonest monopolist maximizes intertemporal pro- 
fits. In particular, he always provides the low quality in 
the second period (because it is cheaper, and there is no 
point keeping a reputation). 

For each period, the monopolist quotes a price and 
then picks a quality. The consumers observe only the 
price immediately; they observe the quality a period later. 
We assume for simplicity that in a given period all con- 
sumers behave identically—they all accept or reject the 
price offered by the monopolist. Furthermore, we look for 
equilibrium behaviors that are independent of the price 
quoted in the first period. Indeed, we will see that there 
exists an equilibrium in which any p; is totally uninforma- 
tive. Let us fix an arbitrary first-period price and consider 
the quality choice of the dishonest monopolist. We will 
then determine which price can be charged in the first 
period. If the consumers do not purchase the product in 
that period, the quality choice is irrelevant. 

Assume that the product is bought. The dishonest 
monopolist can save c, — ¢o by providing the low qual- 
ity. He then makes a profit of 0 in the second period, 
because the consumers learn that he is dishonest and that 
he will therefore provide low quality in the second period 
(the moral hazard problem under complete information— 
see subsection 2.6.2.1 or section 2.3). By providing a high 
quality in the first period, he gains, at most, 5(@ — cg) in 
the second period. This is because, at best, the consumers 
believe that he is honest and are willing to pay 4; milking 


51. Kreps, Milgrom, Roberts, and Wilson actually developed their reputation 
story in the context of predation by a chain store (see chapter 9). However, the 
ideas are similar enough to be straightforwardly applied to the quality problem. 


12a 


52. For the notion of type in games of incomplete information, see the Game 
Theory User's Manual 


53. Of course. consumers should realize that such a strategy may be optimal 
for the monopolist. See below 
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the reputation in the second period then yields 6 — co. 
Now, if c, — co > 6(@ — co), the dishonest monopolist 
has a dominant strategy: to provide the low quality in the 
first period. The maximum price at which the good can be 
sold at date 1 is then p, = Ox,. 

More interesting is the case where c, — cy < 5(8 — 
Co).5* The cost savings due to low quality are dominated 
by the value of the reputation of being honest. This 
suggests that the dishonest monopolist may want to 
mimic (“pool with”) the honest monopolist and provide 
the high quality in the first period. Can this be an equi- 
librium? If both types of monopolists provide the same 
high quality, observing a high quality does not bring any 
information. Thus, the consumers’ posterior (date 2) prob- 
ability that the monopolist is honest is still x,. Hence, the 
consumers are willing to pay E(@s,) = Ox,. In tum the 
dishonest monopolist is willing to provide the high qual- 
ity only if , — eg < 6(@x, — co). If this condition is satis- 
fied, the monopolist does provide the high quality for one 
period before milking his reputation. Of course, he does 
not succeed in fully convincing the consumers that he is 
honest; however, he lets the doubt persist, which allows 
him to sell in the second period at price @x,. Thus we 
have obtained an elementary example of investment in 
reputation 

When 4(8 — co) > cy — 9 > 5(8x, — cg), the unique 
equilibrium has the dishonest monopolist randomize (be 
indifferent) between the high and the low quality in the 
first period. 


Exercise 2.8" Show that, if 6(@—co)>q —¢o 

> d(@x, — cg), the dishonest monopolist establishes a 

reputation for the second period with probability 

4 = 7116(0 — co) — (4 — o)) 
in Xe — oy + 5Cy) 

Note that lim,, .9 = 0. Interpret. 

Notice that the choice of quality and the second-period 
price are independent of p,. To complete the description 
of equilibrium, let us specify the following consumer 
behavior: 


54. For the sake of brevity, we do not consider the borderline case ¢, — tm 
= (8 — <q) Similarly, we will ignore another borderline case below 
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If c, — co > 6(8 — cp), the first period reveals the monop- 
polist’s type. Hence, E(@s,) = Ox,. Both types charge Ox, - 

If cy — cg < 5(@x, — Co), both types of monopolist 
provide the high quality. Hence, E(@5,) = 0. The consum- 
ers are willing to pay up to p, = 6. Both types charge @. 

If d(@x, — co) < c — &o < 6(8— co), the dishonest 
monopolist randomizes: 


E(@s,) = @lx, + (1 — x,)al, 


where a is given by exercise 2.8. The consumers are 
willing to pay up to p, = O[x, + (1 — x,)a]. Both types 
charge this price. 

Note that the higher the incentive to keep a reputation 
(for instance, the lower (c, — co)), the higher the first- 
period price will be. 

Thus, we have considered an equilibrium in which the 
dishonest monopolist reveals his type immediately, or 
tries to keep his reputation (where reputation means the 
consumers’ perception of the possibility of honesty), de- 
pending on the values of the parameters. 

Let us compare this equilibrium with the reputation 
equilibrium of the quality-premium story. In both cases, 
the high price associated with a reputation may induce a 
dishonest monopolist not to cut quality (at least in the 
short run in the quality-premium story). There are, how- 
ever, important differences. First, the new equilibrium is 
based on an asymmetry of information rather than on 
bootstrapping. Second, low cost savings (c, — co) lower 
the incentive to cut quality in both models; however, in 
the quality-premium story, it implies that a lower price 
premium is required to keep incentives. Thus, the market 
price can be lower. In the asymmetric-information story, 
lower cost savings make the dishonest monopolist more 
likely to produce the high quality. This raises the market 
price on average (both in the first and in the second 
period). 

The asymmetric-information theory of reputation can 
be studied more generally in a T-period model. The gen- 
eralization of the earlier equilibrium can be derived using 
the same techniques used before. In general, it takes the 
following form: In the early periods, the dishonest mo- 
nopolist always provides the high quality. Later on, he 
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randomizes between the high and the low quality (and 
‘once he has provided the low quality, he keeps doing so 
until the end). At the end of the horizon, he always 
provides the low quality (see exercise 2.9). 

One of the main points of the Kreps-Milgrom-Roberts- 
Wilson coritribution is that if the horizon is sufficiently 
long and the discount factor 6 sufficiently close to 1 (so 
that the monopolist does not discount the future too 
much), even small initial probabilities x, that the monopo- 
list is honest lead to non-negligible investments in reputa- 
tion (which is not the case in the two-period model; see 
exercise 2.8). 


Exercise 2.9°** In the T-period version of the previous 
model, derive the equilibrium in which (as previously) 
each period’s price does not act as a signal. Let K 
= Pall, — co + dco). Show that if K <1 the equilibrium 
behavior reveals the monopolist’s type in the first period. 
Show that if K > 1 the monopolist pools at period t with 
probability 1 if the current posterior beliefs that the 
monopolist is honest satisfy x, > 1/K7~‘. Interpret. 


In our equilibrium, the market price does not act as a sig- 
nal of the monopolist’s price. One may wonder whether, 
in the cases in which the dishonest monopolist dupli- 
cates the honest monopolist’s quality choice in the first 
period, the honest monopolist would not want to signal 
through an introductory price, as in subsection 2.6.1.2. 
For this question to be well posed, the objective func- 
tion of the honest monopolist must be specified more 
thoroughly. If the honest monopolist simply enjoys pro- 
viding quality, it is not clear why he would try to raise his 
intertemporal profit by making an introductory offer. On 
the other hand, he may also enjoy profits. (Quality comes 
first, however, either because of lexicographic preferences 
or because, for this type, the cost of providing the higher 
quality is lower than that of providing the lower one.) 
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Answers and Hints 

Exercise 2.1 
2p, 2-1 5P 

ey ae ae and s,= > p=a. 
pts p's 


A Cobb-Douglas function implies constant elasticities 
Exercise 2.2 


The demand function is given by 1 — p/s = q or p= 
s(1 — 4). 


wr fm Jdr=1—2>P(q.5)=1 
= >(x,s)\dx = 1 —= (9.5) = 1—q. 
= I. 37 PG 4 


(ii) The monopoly solution is given by equations |2.3] 
and [2.4],5* Here 


t= a 
s(1—qg)—qs= > [2.3] 
and 
(1 — g)q = (cs)q, [2.4] 


which yields q = 3 and s = 2/3c. 

Note that output is independent of c 

The social planner's optimum is given by equations 
[2.1] and [2.2], Here 


2 


cs 
I= = — 2.1 

s( q) 7 [2.1] 

and 

(1 — 9/2)q = (cs)q. [2.2] 


which yields q = 3 and s = 2/3¢. 
Exercise 2.3 


(i) Monopoly: With only one site, the monopolist 
charges 5 — t (If 5 is big, he prefers to cover the market, 
i.e., to reduce price sufficiently so that everybodi buys.) 
With two sites, he can increase the price to 5 — t/2 and 


55. Equations [2.3] and [2.4] specialize the general formulas 2 3 and 24 (given 
in the text above) to this particular demand function. Equations |2.1| and [2.2] 
do much the same 
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still cover the market. The increase in profit from adding 
the second site is AII™ = #/2 — f > 0. 

(ii) Social planner: The consumer surplus is given (inelas- 
tic consumption). The only thing that the number of sites 
affects is the total transportation cost. Thus, the increase 
in welfare from adding a second site is 


1 2 
AW = tats —2 | trdx — f= t/4 —f <0, 
o ° 


Exercise 2.4 


It was shown in the text that if ap < c, — (1 — )cy not 
all uninformed consumers can buy, because otherwise the 
monopolist would cut quality. It was also shown that 
some uninformed consumers must buy, because otherwise 
the monopolist would offer the high quality. Let y in (0, 1) 
denote the fraction of uninformed consumers who buy. 
The monopolist must randomize between the two qual- 
ities (otherwise, y would be 0 or 1). Thus he must be 
indifferent between the two: 


[z+ 71 — 2p — Gg) = HI — ap — 9), 
or 
xp =e —(1— (1 — Fg, + 70! 


This equation defines 7. Note that } grows with x. Next, 
the uninformed consumers must be indifferent between 
buying and not buying. Let f denote the probability that 
the monopolist provides the high quality: 


Of —p=0,orB = p/e 
Exercise 2.5 

(i) The efficient volume of trade is N[1 — G(@))]. High 
6's should get high s’s. 

(ii) Much as in the Robinson Crusoe example, the sup- 


ply of cars, O(p), at price p is O(p) = NF(p/,). The 
average quality in the market is 


P80 | 
s"(p) = ({ a) F(p/A). 


A crucial check is that ds*( p)/dp > 0. 
A buyer with parameter buys iff @s*( p) > 


Pint 8 (55) 


p. Hence, 
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(iii) A competitive equilibrium satishies O(p) = D(p). 

For the specification of the model, 

O(p) = =-. s*(p) = = a N(1 — 26). 

Thus. if 2 24, there cannot be any trade (as in the 
Robinson Crusoe example). If <4, there exists a uni- 
que competitive equilibrium at p = 8, — 263. The vol- 
ume of trade is N(I — 285) < N(1 — 6). 

(iv) By varying F and G, one can easily obtain multiple 
intersections between O and D. Let p, and p, denote two 
equilibrium prices, with p; <p. Suppliers clearly prefer 
the equilibrium at p,. Buyers also do; a higher supply at 
Pz means that the demand is higher as well. Hence, 


Po Pa 
s*(p2) — s*(p,) 


>o— Pi 
0 P2) $*(p) 


— for all 6, 053( ps) — p. > O5*(p,) — p, 


— for all 0, 9 — —=— 


It turns out that the highest-price Walrasian equilib- 
rium need not be optimal, even when account is taken 
of informational constraints (i.e., a central planner can do 
better even if he does not know the private character- 
istics). For instance, some rationing may be desirable. See 
Wilson 1980. 

(v) Consider the specification of question iii. Under a 
minimum quality standard s,, the average quality at price 
p becomes 


s*(p) = (p/Ay + 59)/2. 


Take @ = 3, for example. If s, = 0, no trade occurs. For 
So > 0, trade occurs at p = 3. So + 59° 


Exercise 2.6 

The intertemporal profit can be written as 

x(p, — €)[1 — F(p,)] + dx(p, — c)|1 — Fep,)) 
= (1 + d)x{p, — c(6)]|1 — Fl p,)}, 


where c(5) = (€ + dc)/(1 + 4) is the average discounted 
cost (remember that the cost of producing a clientele of N 
consumers is Nc/x = Né). Note that c(6) decreases with 6 
The maximization of this profit is equivalent to that of a 
monopolist with unit cost c(d). Hence, the market price 1s 
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a decreasing function of 6. The intuition is that the cost 
of building goodwill is offset more and more by the 
second-period benefit from goodwill when the discount 
factor increases. 

When 6 goes to infinity (which corresponds to the case 
in which the consumers learn very quickly and the monop- 
olist can milk his informed clientele for a long time), the 
demand (size of goodwill) converges to the monopoly 
demand under full information. 


Exercise 2.7 


(i) For the low-quality type, the monopoly price is 
determined by the comparison of 59 — Co (sell to all 
consumers) and 4; (859 — Co) (sell only to high-valuation 
consumers). For the high-quality type, replace sp with s, 
and cg with c,. 

(ii) Assume the set of parameters S,. The low-quality 
monopolist, who in a separating equilibrium reveals that 
quality is low, does best by choosing his monopoly solu- 
tion. From (i), he sells to all consumers and makes profit 
(oso — ¢)(1 + 4). He should not want to duplicate the 
high-quality type’s strategy. Assume that the latter sells 
at p, to 8, consumers only. (This requires that p, > 85,.) 
Then we have 


(8959 — €o)(1 + 5) > gulpy — cy + 518,50 — co): 


Next, the high-quality type does not want to duplicate 
the low-quality type’s strategy: 


91(P) — 4 + 6(8,5, — 4) 
ZS M89 — & + 49; (Ay5, — 4), 


where use is made of the optimal monopoly strategy in 
the second period. These two equations, the condition 
p, > %s,, and the non-negativity of profits are satisfied 
for parameters S, and p, = 45, say. To complete the 
proof, suppose that consumers believe that quality is so 
when the first-period price differs from 5 and Qs,. 
Similar computations yield the result under $,. 


Exercise 2.8 


In the text, it is shown that the dishonest monopolist 
chooses the low quality with probability 1 only if c, — co 
> 6(@ — co) and chooses the high quality with prob- 
ability 1 only if c, — co < (Ax, — co). Hence, in equi- 
librium the dishonest monopolist must randomize. Let « 
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denote the probability that he chooses the high quality. 
From Bayes’ rule, the consumers’ posterior probability 
that the monopolist is honest, given that he has provided 
the high quality in the first period, is 


xy = 4,/lxy + (1 — x) 


The dishonest monopolist’s indifference between the two 
choices requires that c, — co = 6(8x, — co). Hence the 
formula in the text. 

When x, is small, the dishonest type cannot choose the 
high quality with non-negligible probability, Otherwise, 
the consumers put most of the weight on him, which 
lowers the benefits from reputation. 


Exercise 2.9 


(The following proof is a mere adaptation of the Kreps- 
Milgrom-Roberts-Wilson proof in their predatory-pricing 
model). 

Work by backward induction from the last period. x, 
denotes the period-t (posterior) probability that the 
monopolist is honest. 

At T, the dishonest monopolist provides the low qual- 
ity whatever xy. 

At (T — 1), the game is identical with the two-period 
case analyzed in the text for initial beliefs x-_,. We know 
that if K <1 the equilibrium involves full revelation for 
any x;-;- If K > 1, the equilibrium is either a “pooling 
equilibrium” (the dishonest type provides the high quality 
with probability 1) or a “semi-revealing equilibrium” (the 
dishonest type provides the high quality with probability 
27-; in [0, 1)), The dishonest type has intertemporal payoff 
from T — 1 on: 6Kx;_, — fo in the semi-revealing region 
and @ — c, + 6(@x;-; — co) in the pooling region. The 
semi-revealing region is defined by 0 < xp, <¥7_, and 
the pooling region by ¥;_; <xp-; < 1, where the cut- 
off probability ¥;_, is defined by 


OKE yy — Cy =O —C, + H(Oxz_; — Co) Fp = /K. 


First, get rid of the uninteresting case K < 1. At T — 2, 
the consumers and the monopolist know that the dis- 
honest type will provide the low quality at date T— 1 
anyway. Thus, everything is as if T — 1 were the end of 
the horizon (i.e., as if we were in a two-period model). 
Hence, the dishonest type provides the low quality for 
any beliefs x;_,. By backward induction, this property 
holds for all ¢ 
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Next, assume that K > I and consider period T — 2. 
Can the dishonest monopolist reveal his type with prob- 
ability 17 If this were so, he would save c, — co relative 
to waiting until T— 1 and milking the reputation then. 
(Such a strategy at T — 1 is actually dominated.) But he 
would lose 6(6 — co) > ¢ — co. Thus, the equilibrium 
cannot be revealing. Again, there are two regions. The 
semi-revealing region (x72 <%7_2) is such that the 
monopolist ends up in the T — 1 semi-revealing region if 
he provides the high quality at T — 2. Hence, the cutoff 
point between the two regions at date T — 2 is deter- 
mined by 


©; — Co = G(OKEz_2 — Cy). 


(At the cutoff point, the monopolist produces the high 
quality with probability 1, which implies xy-, = 7-2). 
Hence, ¥7-) = 1/K?. 

In the semi-revealing region, the dishonest type has 
intertemporal payoff @K*x;_, — co, and, by induction, 
¥, =k" 

For any fixed t, ¥, tends to zero when T goes to 
infinity. Hence, for any x, and any 1, the dishonest mono- 
polist produces the high quality with probability 1 at least 
until t as long as T is sufficiently large 
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Price Discrimination 


For most of chapters 1 and 2 we assumed that a monop- 
olist produces a single good and sells it at a uniform 
price (per unit). In general, uniform pricing leaves some 
surplus to the consumers. (Indeed, we saw in chapter 2 
that this nonappropriability of the consumer surplus is 
a force pushing toward the introduction of too few 
products by a monopolist.) 

Uniform pricing may be a good generalization for 
most retail markets. However, examples abound in which 
the same economic good is sold at different prices to 
different consumers. A doctor may charge a rich patient 
more than a poor one, or may charge an insured patient 
more than an uninsured one. The price of a consumer 
good may vary across areas of a city or a country, even 
in the absence of any significant cost differential. The 
same economic good may also be sold at two different 
prices to the same consumer. This occurs, for instance, 
when the producer practices quantity discounts, in which 
the marginal unit can be purchased at a lower price than 
the inframarginal ones. Such examples may be viewed as 
attempts by the producer to capture a higher fraction of 
consumer surplus than he would if he charged a uniform 
price 

It is hard to come up with a satisfactory definition 
of price discrimination. Roughly, it can be said that the 
producer price-discriminates when two units of the same 
physical good are sold at different prices, either to the 
same consumer or to different consumers. 

This definition is unsatisfactory, and sometimes it must 
be amended or extended. First, consider the case of 
a cement producer serving a geographic area. To the 
producer's costs must be added the freight costs. Suppose 
that the cement producer is vertically integrated and thus 
provides his own transportation. In such a case, a uniform 
delivered price is discriminatory, whereas delivered prices 
that respond fully to transportation-cost differentials 
among consumers located at different distances from the 
factory are not. Hence, we will say that there is no price 
discrimination if differences in prices between consumers 


exactly reflect differences in the costs of serving these 
consumers (this amounts to considering the net cost of 
serving a consumer). Second, it should not be inferred 
that price discrimination does not occur when differen- 
tiated products are sold to different consumers.’ The use 
of different qualities of services (e.g., classes in trains 
and airplanes) is also partly an attempt to capture con- 
sumer surplus by separating consumers into different 
groups, as we shall see. Thus, it is difficult to offer an all- 
encompassing definition. A general-equilibrium theorist 
might rightly point out that goods delivered at different 
dates, at different locations, in different states of nature, or 
of different quality are distinct economic goods and thus 
that the scope of “pure” discrimination is very limited.* 

The possibility of price discrimination is linked to the 
possibility of arbitrage. It is conventional to distinguish 
between two types of arbitrage. 

The first type of arbitrage is associated with the trans- 
ferability of the commodity. It is clear that if the transaction 
(arbitrage) costs between two consumers are low, any 
attempt to sell a given good to two consumers at different 
prices runs into the problem that the low-price consumer 
buys the good to resell it to the high-price one. For 
instance, the introduction of quantity discounts (which, 
as we will see below, is often optimal) implies that, in 
the absence of transaction costs between consumers, only 
one consumer buys the praduct and resells ik Lo other 

consumers. For example, if each consumer buys according 
to the “two-part tariff” T(q) = A + pq (where A > 0 is 
a fixed fee and p is the marginal price), only one consumer 
will pay the fixed fee. Hence, if there are many consumers, 
everything is almost as if the manufacturer sold at the 
linear (or uniform) price. If consumers can arbitrage per- 
fectly, the producer is generally forced to charge a uniform 
or fully linear price: T(q) = pq. 

Transaction costs offer a clue as to when price dis- 
crimination is feasible. Services such as medical treatment 
and travel are much less transferable than most retail 
commodities. Similarly, the consumer can hardly engage 
in arbitrage for electricity or telephone calls. 

Of course, perfect (costless) arbitrage and no arbitrage 


1, In the example of the cement producer, consumers indeed buy spatially 
differentiated goods. However. each consumer can consume only a well-defined 
commodity ("cement delivered at the consumer's location’). We want to extend 
the notion of discrimination to cases in which the consumer is given a choice 
among several differentiated goods 
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are two extreme cases. In general, some limited arbitrage 
may occur, depending on the relative cost and benefit: 
Consider the use of a false student ID card in order to 
enjoy a student discount. An interesting case of partial 
arbitrage, and therefore partial discrimination, is that of 
a manufacturer who sells his product to several retailers. 
The retailers may arbitrage if the manufacturer tries to 
charge different marginal prices to different retailers. This 
prevents the manufacturer from imposing general non- 
linear tariffs T(q) to his retailers. However, even though 
he may not be able to observe the exact quantity sold by 
each retailer, he may observe that a retailer carries his 
product. Then (ignoring legal constraints) he can charge a 
two-part tariff T(q) =A + pq, where A is a fixed pre- 
mium (a “franchise fee,” in this case). 

Two-part tariffs will also be applicable whenever the 
(variable) consumption of the arbitraged good is linked 
with the (fixed) consumption of a complementary prod- 
uct, as in the case of razor blades and razors or that of 
Polaroid film and Polaroid cameras. If pictures (rather than 
film and camera) are thought of as the final consumption 
good, the producer can manipulate the relative prices 
of the two inputs that produce pictures to discriminate 
(ie, can charge two different prices for different units of 
the good “picture”). The analogy with the franchising 
example is that the fixed fee must in practice be paid by 
every consumer, while the good which is consumed in 
variable proportions may be subject to arbitrage. 

The second type of arbitrage is associated with the 
transferability of demand between different packages or 
bundles offered to the consumers. Here there is no phys- 
ical transfer of good between consumers. The consumer 
simply chooses between the different options offered, For 
instance, a consumer may have a choice between buying 
two units of the good at total price T(2) and buying one 
unit at price T(1) (this is known as a price-quantity pack- 
age) or between first-class and second-class accommoda- 
tions on a train (a price-quality package). As we will see, if 
consumers’ tastes differ, the producer generally wants to 
target a specific package for each consumer. However, in 
the absence of information about the identity of each 


2. See Phlips 1983 for an extensive discussion of the notion of discrimination 
This introduction to chapter 3 is influenced by his introduction to the 
topic. Varian 1987 and Wilson 1985 are two other useful treatments of price 
discrimination 
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consumer (the producer only knows the aggregate distri- 
bution of tastes), the producer must make sure that each 
consumer indeed chooses the package designed for him 
and not the package designed for another consumer. For 
instance, the traveler to whom first class is targeted 
should not want to travel second class because the price 
savings more than offset the reduction in quality. This, as 
we will see, puts “incentive compatibility” constraints on 
the set of packages offered by the producer, The producer 
must use “self-selection devices.” 

In terms of the consequences of discrimination, the two 
types of arbitrage are naturally very different. The trans- 
ferability of the product tends to prevent discrimination, 
whereas the transferability of demand may induce the 
producer to increase discrimination. In the supplementary 
section we will see that, under reasonable assumptions, 
the producer enlarges the quantity spectrum or the qual- 
ity spectrum when he has information about aggregate 
demand rather than about individual demands. 

Following Pigou (1920), it is customary to distinguish 
three types of price discrimination. First-degree price dis- 
crimination is perfect price discrimination—the producer 
succeeds in capturing the entire consumer surplus, This 
occurs, for instance, when consumers have unit demands 
and the producer knows exactly each consumer's reser- 
yation price and (if these reservation prices differ) can 
prevent arbitrage between consumers. It then suffices for 
the producer to charge an individualized price equal to 
the consumer's reservation price. Perfect price discrimina- 
tion is unlikely in practice, either because of arbitrage 
or because of incomplete information about individual 
preferences, In the case of incomplete information about 
individual preferences, the producer may still be able to 
extract consumer surplus imperfectly by using the self- 
selecting devices mentioned earlier. This is called second- 
degree price discrimination.* Also, the producer may 
observe some signal that is related to the consumer's 
preferences (e.g. age, occupation, location) and use this 
signal to price-discriminate; this is termed third-degree 
price discrimination. The important difference between 
second-degree and third-degree price discrimination is that 


3. We encountered an instance of second-degree price discrimination in the 
supplementary section of chapter 1, where we saw how time could screen 
consumers with different valuations for a durable good, High-valuation con- 
sumers are mare eager to buy early. They pay a higher price than low valuation 
buyers. who wait to purchase the good. A difference with the examples below 


third-degree discrimination uses a direct signal about 
demand, whereas second-degree discrimination selects 
indirectly between consumers through their choice be- 
tween different packages. 

We will not quite follow the traditional ordering. After 
considering when perfect price discrimination is feasible 
(section 3.1), we will study third-degree price discrimina- 
tion (section 3.2). The ideas contained in section 3.2 are 
direct applications of previous material on multi-product 
monopoly and the inverse elasticity rule (see section 1.1). 
To clearly differentiate this type of imperfect discrimi- 
nation from second-degree price discrimination, we will 
assume that the monopolist is able to divide the market 
into n segments of customers on the basis of direct signals, 
but that he is unable to discriminate between consumers 
within a group (either because of unit demands or because 
of commodity arbitrage). We will then tackle the newer 
concept of second-degree price discrimination by assuming 
that the monopolist knows how consumers’ preferences 
are distributed within a group but does not know each 
consumer's preferences. This will lead to a consideration 
of screening or self-selection devices, which is developed 
in more detail in the supplementary section. Our study of 
second-degree price discrimination will also be extended 
to exhibit the analogy between discrimination over quality 
spectra and nonlinear tariffs. 


Se EE EERE EE REEEEEEEE EERE EEE EEE ERE et 
3.1 Perfect Price Discrimination 


The simplest kind of perfect price discrimination occurs 
when a single consumer (or, equivalently, a number of 
identical consumers) has unit demand. Suppose that each 
consumer has v as his willingness to pay (valuation) for a 
good. The monopolist, by charging price p = v, extracts 
the whole consumer surplus. 

Next, consider the case of identical downward-sloping 
demands. Suppose that the n consumers in the market all 
have the same demand, q = D(p)/n. for the monopolist’s 
product, and that this demand function (and, therefore, 
the aggregate function q = D(p)) is known by the mo- 


is that in the earlier example the monopolist actually loses from price dis 
crimination. As we saw. he would prefer to commit to a fixed (uniform) price 
path (also, the model was constructed so that in 2 one period framework the 
monopolist would be unable to discriminate), In section 3.3, we will assume the 
credibility issue away 
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nopolist. By using an appropriate pricing schedule, the 
monopolist can increase his profits over those achieved 
with a linear pricing schedule (which yields p™D(p™) 
— C(D(p™))); he can even extract all of the potential 
social surplus. By pricing schedule, or tariff, we mean the 
total amount of money T to be paid by the consumer as a 
function of consumption q. A linear pricing schedule cor- 
responds to a single price: T(q) = pq. An affine pricing 
schedule corresponds to a two-part tariff: T(q) = A + pq. 

Imagine first that the monopolist adopts the competi- 
tive pricing schedule, ie., T(q) = p*q, where p* is the com- 
petitive price (see figure 3.1). Let S* be the corresponding 
net consumer surplus: 


i 
Gas [ (P(q) — p*ldq, 
0 


where P(q) = D™'(q) is the inverse demand function. 
Now suppose that, in order to have the right to buy at 
p®, the consumer must pay a fixed premium. This fixed 
premium, A, may go as high as S*/n without inducing the 
consumers to refrain from buying anything. By offering 
the “affine” (but nonlinear) pricing schedule or “two-part 
tariff,” 


by pes ifqg>0 
(| 
0 ifqg =O, 


the monopolist realizes a profit equal to 


Tl = S* + pSq* — Clq*), 


p 


Figure 3.1 
(Tl ts the area of the shaded section.) 
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which is simply the social surplus at the optimum. This is 
to be expected, since the marginal price charged by the 
monopolist is equal to his marginal cost, and consumers 
have zero surplus after the fixed premium S‘/n is sub- 
tracted. It is easy to see that this strategy affords the 
monopolist his maximum profit. (In particular, he makes 
more profit than with the optimal linear schedule, which 
yields a profit equal to p™q™ — C(q™).) Unless the mo- 
nopolist uses coercive means, the consumers can always 
guarantee themselves a zero surplus by not buying at all. 
Since the sum of the consumer surplus and the monop- 
oly profit is equal to the total surplus, the maximal 
profit of the monopolist is equal to the maximal total 
surplus. 


Exercise 3.1° 

(i) Show that an alternative way to realize the optimal 
profit under discrimination is to charge each consumer 
a tariff equal to his gross surplus: 


Tq) = IM P(x)dx/n, 
o 


(ii) Generalize to the case of a monopolist facing a com- 
petitive fringe at po > p*. (By definition, a competitive 
fringe is willing to supply any demand at price po but 
supplies nothing for a lower price.) What is the optimal 
two-part tariff? To implement an alternative way of 
appropriating consumer surplus, how can one redefine 
“gross surplus”? 


So far, we have assumed that consumers are identical, 
Now suppose that the consumers have different demand 
curves and assume that the monopolist knows each in- 
dividual demand curve. For the monopolist, the optimal 
pricing scheme consists of charging for the marginal unit 
a price (p*) equal to the marginal cost and demanding 
a personalized fixed premium equal to the net surplus Sf 
from consumer i at the price p*. A special application 
of this result is the case where the aggregate demand 
function is derived from unit demand functions (each 
consumer consumes one or zero unit of the good) of 
consumers each of whom has a different willingness to 
pay. Then, the two-part tariff is equivalent to a simple 
system of personalized prices, with each consumer paying 
a sum equal to his willingness to pay. 
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Of course, there are considerable disclosure problems. 
Certainly a consumer is not likely to reveal that he is 
among those willing to pay a higher price for the good. 
This may destroy the possibility of discrimination. For 
instance, when consumers have unit demands and the 
producer knows only the distribution of valuations over 
the population but not the individual valuations, a single 
price is charged—i.e., discrimination does not occur. (The 
optimal price is then equal to the monopoly price for the 
downward-sloping demand curve formed by the addition 
of unit demands.)* 

We now tum to imperfect price discrimination. 


3.2 Multimarket (Third-Degree) Price 
Discrimination 


3.2.1 The Inverse-Elasticity Rule Again 


Suppose that a monopolist produces a single product 
at a total cost of C(q), and that he is able to divide 
the aggregate demand into m “groups” or “markets” on 
the basis of some “exogenous” information (e.g. age, 
sex, occupation, location, or first-time vs. second-time 
buyer). These m groups have m distinct downward- 
sloping demand curves for the product. The demand 
curves are known to the monopolist. We will assume 
that arbitrage cannot occur between groups but that, at 
the same time, the monopolist cannot discriminate (even 
in a second-degree sense) within a group. (These con- 
ditions will have to be checked in each application.) Hence, 
the monopolist charges a linear tariff for each group. 
Let 


(Pa ntsoePiost2s Pet 
denote the prices in the different markets, and let 
{a1 = Dy(p,),.. 14m = Dy Pm) } 


denote the quantities demanded. Let 


Gi = Dip). 


y Dip) 
4 


denote the aggregate demand. The monopolist chooses 
prices to maximize his profit: 


¥ pDip) — c($ Dip). 


Formally, this price-discrimination program is a special 
case of the multiproduct monopolist’s pricing problem 
described in chapter 1, where demands are independent 
and costs are (possibly) dependent. We know from this 
analysis that relative price margins are given by the 
inverse-elasticity rule: For all i, 


p= C'G) _1 
vi e 
where &; = —D{(p;,)p,/D,(p;,) is the elasticity of demand 


in market i. Optimal pricing implies that the monopolist should 
charge more in markets with the lower elasticity of demand. 

This rule explains why students and senior citizens are 
given discounts by private firms with no redistribution 
intention, why legal and medical services are priced ac- 
cording to the customer's income or amount of insurance 
coverage, why the prices of goods in different countries 
sometimes do not reflect transportation costs and import 
taxes, and why first-time subscribers to a magazine are 
given discounts.* 


3.2.2. Welfare Aspects 


When a monopolist’s various products are actually the 
same physical good sold to different markets, an interest- 
ing question is: What would happen if the monopolist 
were forced to charge the same (uniform) price in all 
markets? Comparing the two situations gives a measure 
of the effect of third-degree price discrimination. The 
monopolist is better off under price discrimination, because 
“at worst” he can always charge the uniform price in each 
market. Consumers in low-elasticity markets are adversely 


4, Discrimination could operate if the monopolist could charge a tariff function 
of the probability for the consumer to get the good (ie., whether the consumer 
gets the good or not is not determined in advance, but depends on the "toss of 
a coin"). However, it can be shown that if both the monopolist and the con- 
sumers are “risk-neutral.” such complicated mechanisms are not optimal for the 
monopolist: 
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5. In this last example, price discrimination could also be explained by match 
ing issues—see the supplementary section of chapter 2. The idea here is more 
similar to that developed in the supplementary section of chapter 1; the con- 
sumers who have not yet subscribed are perceived by the monopolist as less 
eager to buy the magazine than those who have already subscribed. 
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affected by the discrimination and would prefer a uniform price; 
consumers in high-elasticity markets prefer discrimination.® 

To compute the associated change in welfare, assume 
constant returns to scale: C(}°,q;) = e().:4i).’ 

Under discrimination, the monopolist charges p; in 
market i. The demand is q; = D,(p;). The aggregate net 
consumer surplus is };5,(p;), and the firm’s profit is 
Lilp; — q;. Suppose next that discrimination is pro- 
hibited. The monopolist charges a uniform price p and 
sells 7, = D,(p) in market i. The profit is), (7 — oq, and 
the consumer surplus is }), 5,(7). Let Aq; = 4) — 4. 

The difference in total welfare between discrimination 


and no discrimination is equal to the change in surplus 
plus the change in profits: 


AW = (= (S:(p;) — (Pi) 


+(Ze — qi -Ee-om). 


We now derive an upper and a lower bound for AW. 
To do so, we make use of the fact that the net surplus 
function is convex in the market price. (Recall that the 
derivative of the net surplus with respect to price is 
equal to minus the demand, and that the demand curve 
is downward sloping: S'(p) = —D(p) implies 5"(p) 
= —D"(p) > 0.) 

A well-known property of a convex function is that it 
is everywhere above its tangents, as gure 3.2 shows. 
Hence, 


Sipe) — SB) > Si(B)( pi — BD. 


Using this inequality and the fact that 5/(p) = —D,(P), 
we obtain 


AW> x (p; — 04g). (3.1) 


6. Under a uniform price # and a constant marginal cost ¢. the monopolist 
maximizes 


(F-#(¥ Op) 
which yields a price-cost margin of 


@ -we=—(E 017) /7(E B17) 


5 Dup)) (x DiF)s) 
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ry 
a 
Figure 3.2 
Similarly, we have 
SAP) — Spi) > Si( Pp) (P — pos 
which yields 
AW <(7-0) (5 a) (3,2) 


That price discrimination reduces welfare if it does not 
increase total output (a fact implied by equation 3.2) is 
intuitive. Price discrimination causes marginal rates of 
substitution to differ among consumers, and is thus so- 
cially inferior to uniform pricing if the goal is to distribute 
a given amount of good between them. Thus, a necessary 
condition for price discrimination to be preferred socially 
is that it raise total output (i.e., that it reduce the tradi- 
tional monopoly pricing distortion). (To obtain the 
bounds 3.1 and 3.2, we make no assumptions about the 
firm's behavior. However, to obtain specific results we 
will need to use the fact that the producer maximizes 
monopoly profits.) 


‘Thus. the inverse of the price-cost margin is a weighted average of the elas 
ticities of demand. where the weights are the quantities demanded. (As an 
exercise, write the price-cost margin as a weighted average of the inverse 
elasticities.) Hence 


min (1/e;) <(B — VB < max (1/e,) 
7. This discussion is inspired by Varian 1985 Hausman and Mackie-Mason 


(1986) give general conditions under which price discrimination is welfare 
improving when the production technology exhibits increasing returns to scale: 
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3.2.21 An Application to Linear Demand Curves 


Assume that the demand curve in market i is q; = a; — bp. 
Assume further that, for all i, a; > cb;. This condition 
ensures that under discrimination all markets are served. 
The monopolist, if he can discriminate, chooses a price 
pi in market i so as to maximize (p,; — c)(a,— 6 
Straightforward computations show that 


Pi = (a, + cb,)/2b, 
and 


q; = (a; — cb;)/2- 


Suppose next that the monopolist is forced to charge 
a uniform price p across markets. Assume that all markets 
are served at the optimum (see the caveat below). The 
monopolist chooses 7 so as to maximize 


o— [xn —(za)p], 


This leads to 


Fa-[Za—«(E4) 2 


Total output is the same in the two arrangements: 
dig =LiF, or Y, Ag; =O0—a result obtained by 
Robinson (1933). From equation 3.2, welfare is lower 
under price discrimination (Schmalensee 1981). 


3.2.2.2 Caveat 


The conclusion that price discrimination lowers welfare 
when the demand curve is linear is contingent on the 
assumption that all markets are served under a uniform 
price. This assumption is actually quite strong. When 


forced to charge a uniform price, the monopolist de facto 
“robs Peter to pay Paul’—he raises the price in high- 
elasticity markets and lowers that in low-elasticity 
markets (see footnote 6). The increase in price in the high- 
elasticity markets may induce consumers in those markets 
to stop purchasing. 

To see that the welfare conclusion can be reversed, 
suppose that there are two markets (m= 2) and that, 
under a uniform price, the second market is not served. 
The uniform price is then equal to the monopoly price for 
the first market: 7 = p? and 7, = qf. On the other hand, 
2 = 0 <q. From equation 3.1, welfare is higher under 
price discrimination. Indeed, price discrimination leads to 
a Pareto improvement. The monopolist makes more 
profit, and the surplus of the consumers in market 2 
becomes positive while that of the consumers in market 1 
is unchanged. 


3.2.2.3 Summary 


To summarize our welfare analysis: The welfare effects of 
third-degree price discrimination are ambiguous. One has 
to weigh the losses of consumers in low-elasticity markets 
against the gains of those in high-elasticity markets and of 
the producer. The elimination of price discrimination may 
be particularly dangerous if it leads to the closure of 
markets.? 

The previous discussion assumes that welfare is equal 
to the sum of all consumer surpluses and profits, i.e., that 
the government has efficiency concems but no redistri- 
bution concems. Of course, one of the main policy issues 
in regard to price discrimination is its effect on income 
distribution. As we saw, price discrimination redistributes 
income away from the low-elasticity groups toward con- 
sumers in the high-elasticity groups and the monopolist. 
Of course, raising monopoly profits at the expense of 
consumers may not be very desirable. On the other hand, 
the low-elasticity groups are often (but not always) the 
richer consumers.‘ Thus, it is clear that one cannot a 


8 All markets are served if for all. a, — bg > 0 


9. The higher proft associated with discrimination may also increase wasteful 
competition to obtain the manopoly rent. This led Posner (1976) to notice that 
if the monopoly rent is completely dissipated (see section 1.3). price discimina- 
tion is more likely to be harmful, It now suffices that consumer surplus is lower 
under discrimination for uniform pricing to be preferable. Indeed. under perfect 
price discrimination there is no surplus left and price discrimination is clearly 
detrimental—a reversal of our previous conclusion. (However. even with 
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wasteful rent dissipation, price discrimination is not always harmfil See the 
previous two-class example, in which uniform pricing leads to the closure of 
one market.) This remark is, of course. also valid for second-degree price 
discrimination, As usual, the vahdity of the wasteful rent dissipation argument 
requires further analysis, 


10. In the examples given at the end of subsection 3.2.1, the high-elasticity 
groups are students, the elderly, indigent patients and legal assistees, or poor 
countries. 
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priori make a case against price discrimination on the basis 
of income distribution. 


3.2.3 Applications 


A few examples of discrimination based on observable 
characteristics follow. 


3.2.3.1 Application 1: Spatial Discrimination 


Consider the location model described in section 2.1. 
Suppose that a monopolist produces a good at a single 
plant. To simplify, assume that the transportation cost per 
unit of good is proportional to the distance to the plant; 
i.e, it is tx for a customer located at a distance x. Suppose 
that the producer transports the good himself. He charges 
this customer a delivered price p(x); equivalently, he could 
charge a free-on-board (f.0.b.) or mill price of p(x) — tx 
and let the customer take care of the transportation (of 
course, this second policy would tend to lend itself to 
arbitrage). The production cost is c per unit. 

Suppose that the firm's customers are located at various 
distances from the plant. To focus on the spatial issue, 
assume that all customers have the same demand at a 
given delivered price; let q = D(p) denote this demand 
Assume that the firm charges the same price to all 
customers located at the same location. (One could, for 
instance, imagine that customers located at the same site 
engage in arbitrage. For a discussion of the possibility 
of arbitrage between sites, see below.) The monopolist 
maximizes monopoly profit in the market formed by 
customers located at x; in other words, he maximizes 
(p — tx — c)D(p). 

The transportation cost (t) is analogous to an excise 
tax. In the terminology of chapter 1, p is equivalent to 
the “consumer price” and p — tx to the “producer price.” 
We know from chapter I that, in general, excise taxes 
are not fully passed on to the consumer. For instance, for 
a linear demand, the producer bears half of an increase in 
the excise tax. For an exponential demand function, the 
consumer bears the entirety of any tax increase. For a 
constant-elasticity demand function, the consumer bears 
more than the entirety of a tax increase (see exercise 1.3), 


11, See pp, 24~25 of Phlips 1983 for examples of uniform delivered prices in 
the cement. plasterboard, and clay brick industries, and for an analysis of the 
use for discriminatory purposes of the basing-point system. The basing-point 
system expresses the delivered price as 2 base price plus the transport cost from 
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Reinterpreted in our location model, an increase of dr 
in the distance to the plant implies an increase in the 
(delivered) monopoly price of dp = tdx in the exponential 
case, dp = tdx/2 in the linear case, and dp = tdx/(1 — 1/e) 
in the constant-elasticity case. An exponential demand 
leads to the absence of price discrimination: Any differ- 
ence in cost is fully reflected in the delivered price. In 
other words, the f.0.b. price is the same for all customers. 
In the linear-demand case, the consumer does not pay the 
full transportation cost. The monopolist practices “freight 
absorption” and discriminates against his nearest custom- 
ers. In the constant-elasticity-of-demand case, discrimina- 
tion plays against the farthest consumers. 

The possibility of arbitrage between consumers at dif- 
ferent locations invalidates the type of discrimination that 
would otherwise occur under constant elasticity of de- 
mand Kf the transnart technalagy used by the managolist 

is commonly available, the customers located near the 
factory will buy and resell to the consumers located farther 
away. In such contexts the producer must content himself 
with nondiscriminatory prices. The policy of freight ab- 
sorption is immune to the possibility of arbitrage between 
the consumers. Thus, although the way discrimination 
operates is theoretically very sensitive to the demand 
function, the possibility of arbitrage suggests that only 
freight absorption will be observed. 

Freight absorption is commonly observed in practice.'! 
The possibility of arbitrage may not be the only explana- 
tion for this. In some circumstances, our assumption that 
the demand function is the same at each location is likely 
to be violated. Consumers located farther away from the 
factory are more likely to be able to buy from an alter- 
native source than consumers nearby; a cement consumer 
located 50 miles from each of two competing cement 
producers that are 100 miles apart has more substitution 
possibilities than a consumer located 10 miles from one 
producer and 90 miles from the other. Price discrimination 
is, then, a useful way for each producer to compete for the 
marginal consumers without granting price concessions 
to the inframarginal (captive) ones, At the current stage, it 
suffices to think of the elasticity of demand as increasing 


a basing point (which need not be the factory's location), Basing-point pricing 
is often used as a collusive device that allows competitors to coordinate their 
pricing policies and to easily detect defections from collusive behavior. 
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with the distance to the factory. Strategic interactions 
between spatial oligopolists will be considered in chapter 7. 


Exercise 3.2". Suppose that consumers have a linear de- 
mand function and are located uniformly from distance 
x = 0 to distance x = 1. The transportation cost to dis- 
tance x is tx. 

(i) Compute the optimal f.o.b. prices when discrimina- 
tion is allowed, 

(ii) Suppose that transportation is operated through a 
competitive sector at the same unit cost, f. Compute the 
optimal uniform (i.e. nondiscriminatory) f.o.b. price, as- 
suming that the whole market is served. Can one say that 
in the no-discrimination case some consumers “cross- 
subsidize” others? 

(iii) Which arrangement serves the largest market in 
general? 


3.2.3.2. Application 2: Vertical Controls as an 
Instrument of Discrimination 


Suppose that a monopolist produces a good which is used 
as an input by two competitive industries producing dif- 
ferent final goods (or, alternatively, that the monopolist’s 
good is sold by competitive relailers to two classes of 
consumers). The final goods (i = 1, 2) face two indepen- 
dent demands, which have elasticities such that e, > €, 
(for the following, it may be convenient to think of 
constant-elasticity demands). Also assume, for simplicity, 
that each industry converts one unit of input into one unit 
of output, 

Because of the technology and the fact that down- 
stream industries are competitive, each final good’s price 
is equal to the intermediate price charged to the industry 
manufacturing it. Hence, everything is as if the monopo- 
list sold his good directly to the consumers in the two 
industries. The downstream industries are but a “veil,” 
Hence, the intermediate good’s optimal prices are 
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The solution in which the mc 
downstream technologies hims 
is called the “vertically integra 

There is a caveat, howevet 
integrated profit, the monopol 
arbitrage between the two in 
buy the intermediate good at t 
it to industry 1, If the monopol. 
directly by precluding industr 
the input to industry 1 and the 
discrimination, his optimal solu 
ing to industry 2. To accom; 
can buy a firm in industry 2 
price at p3, and sell the interm 
other firms (see figure 3.3). Or 
will buy the good, since the fi 
be able to compete effectively 
sidiary in the sale of consumpti 
the United States, Alcoa had 
production of virgin-aluminur 
good, It integrated into marke 
demand (rolled sheet) and, thr 
the intermediate good, squeeze 
tors in these markets. '3 

Therefore, vertical integrati 
stitute for price discriminatior 
cannot control directly the resa 
buyers, and legal rules against 
lead to vertical integration, 


12. For more on vertical control and third-degree price discrimination, see 
Perry 1978, 


13. For a general discussion of price squeezes in nonregulated and regulated 
environments, see Joskow 1985. Judge Learned Hand, who wrote the opinion 
in the Alcoa case, proposed the “transfer price test” to assess whether a firm is 
engaged in a price squeeze, This test considers whether the integrated firm can 


sell the final output profitably at prevailins 
same price for the input it produces inter 
competitors. If the answer is No, then the fr 
In the Alcoa case, the Court found that Ale« 
been negligible or negative if the downstr 
rather than the internal price to the upstreai 


A monopolist who adopts the “reverse policy” and 
serves the low-elasticity market himself may have to im- 
pose some further control over the industry serving the 
other market. For example, in an antitrust case in the 
United States concerning the White Motor Company, the 
industry with low demand elasticity was made up of 
various public agencies (local and federal) and was served 
by the manufacturer himself. The one with high demand 
elasticity was a private market serviced by retailers. The 
manufacturer had forbidden these retailers to sell to public 
agencies. 


3.2.4 Third-Degree Price Discrimination in 
Intermediate-Goods Markets 


The previous analysis assumed that the buyers of the 
monopolist's good are consumers—i.e., that the good is 
a final one. However, U.S. laws restricting third-degree 
price discrimination are particularly concerned with inter- 
mediate-goods markets. The aim of the Robinson-Patman 
act (which has not been very strictly enforced lately) 
was to protect small businesses from “unfair” advantages 
possessed by large buyers in such markets. A typical 
situation that act was concerned with is that of a chain 
store that is able to obtain discounts below the prices 
charged to local stores from suppliers. Katz (1987) notes 
two reasons why the previous analysis cannot straightfor- 
wardly be applied to intermediate-goods markets. First, the 
demand of the “consumers” is interdependent. For in- 
stance, the chain store and the local store compete on the 
product market; thus, how much one of the competitors 
purchases from the monopolist depends not only on the 
price charged to him but also on the price charged to the 
other competitor. Solving for product-market competition, 
one obtains a case of interdependent demands (see chap- 
ter 1). Second, distributors are much more likely than con- 
sumers to integrate backward into production. In partic- 
ular, the chainstore, if it does not obtain advantageous 
conditions from the monopolist, may decide to supply the 
good itself. (The local store is less likely to do so, because 
of the existence of increasing returns to scale at the pro- 
duction stage—for example, the fixed cost of production 
may be too high for a low-volume producer.) 
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Suppose that a monopolist f 
heterogeneous consumers. If 
consumer's taste, he can, in 


Ja. That raising the price for the local store reduces the chain store's incentive 
to integrate is by no means trivial If the conditions alluded to are not satisfied 


‘other patterns may emerge For instanc 
may go down while that to the chain st. 


packages or bundles (price and quantity, price and quality, 
etc,) to consumers. He then achieves perfect discrimina- 
tion. Suppose, however, that the monopolist cannot tell 
the consumers apart. In particular, assume (in contrast 
with the preceding section) that there is no exogenous 
signal of each consumer's demand function (such as age or 
occupation).'* This does not mean that the monopolist 
will not try to discriminate between consumers and will 
content himself with a unique bundle for all consumers. 
He can offer a menu of bundles to choose from. In doing 
so he must, however, take into account the possibility of 
personal arbitrage, i.e, the possibility that a consumer 
to whom a given bundle is directed may want to choose 
a bundle directed to another consumer. This introduces 
“self-selection” or “incentive-compatibility” constraints, 
which, in general, make perfect price discrimination im- 
possible (see the example in section 3.1 involving con- 
sumers with unit demands). 

This section starts with a simple example of two-part 
pricing. Because a two-part tariff is generally not optimal, 
it then proceeds to a consideration of more general non- 
linear pricing schemes. It goes on to demonstrate the 
strong analogy between discrimination through price- 
quantity bundles and discrimination through price-quality 
bundles. Optimal pricing is considered in more detail in 
the supplementary section, along with discrimination in 
the insurance market 


3.31 Two-Part Tariffs'* 


A two-part tariff (T(q) = A + pq) offers a menu of bun- 
dles [T,q} (a continuum of them, actually) located on a 
straight line. Unlike the line representing a pure linear 
tariff, this straight line need not pass through the origin. 

Two-part tariffs are commonly used in practice. Table 
3.1 gives a few examples. 

The main appeal of two-part tariffs is their simplicity. 
For instance, what the consumer pays to enter an amuse- 
ment park could be (and sometimes is) a more compli- 
cated function of the number of rides than it is under a 
two-part tariff; however, it is costly to keep track of the 
number of rides each person has taken. Sometimes two- 
part tariffs may also be justified by the possibility of 


15. Alternatively, one could look at price discrimination within a class identi- 
fed by a signal 
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decreasing marginal utility o 
sentation of the utility funct 
that varies across consumer 
consumers 
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18. For more on two-part tariffs, see Oh 


“marginal utility of income.” Hence, we can generate 
apparent differences in tastes simply from differences in 
income.'7 

Assume that there are two groups of consumers. Con- 
sumers with taste parameter 0, are in proportion 4; those 
with taste parameter @, are in proportion 1 — 2. (The 
absolute number of consumers does not matter with con- 
stant marginal cost, and can be normalized to 1.) Assume 
that 8, > 6, and that the monopolist produces at con- 
stant marginal cost c < #, < 43. 

To simplify the computations, assume that 


1—(1—4y 


vq) = 3 


(so that V'(q) = 1 — q is linear in quantity). 

We shall consider, in sequence, the case of perfect 
discrimination, the case of uniform, nondiscriminatory 
monopoly pricing, and the case of a two-part tariff. (We 
are interested in the first two cases only for comparison 
with the third.) 

As a preliminary, let us compute the demand func- 
tion for consumer 6; facing marginal price p. (With this 
demand function, the fixed premium affects only the 
decision whether to buy the good, If he has decided 
to buy, the consumer does not take it into account when 
choosing how much of the good to buy.) The consumer 
maximizes 


{4,V(q) — pat, 
which yields 
6,V'(q) = p. 


For our specification of preferences, 8,(1 —q) = p. The 
demand function is thus 


q = Dip) = 1 — p/8, 
The net consumer surplus is 
5p) = 4 V(Di(p)) — p Dip) 


(leaving out the fixed fee, if any). In this special case, 


Figure 3.4 
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(Note that 5(6,) = 0, and tha 
for 0, types.) The demand « 
are depicted in figure 3.4 
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Then the aggregate demand 
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3.3.1.1 Perfect Discriminat 
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3.1, he can charge a margin: 
personalized fixed premium 
surplus at price c. For con 
premium is 


17. The analysis in this subsection. as well as that for the fully nonlinear case 
could easily be extended to preferences Vig.) — Tax long as ¢V/2# > 0 and 
© V/éqeb > 0. 


(0, — c? 
26, ° 


The fixed premium is naturally higher for the high-demand 
consumer. The monopolist’s profit is 
(8, — c)? (6, — co)? 


ze, 1 tag 


A, = §{0) = 


m1, =4 


As we saw in section 3.1, welfare is optimal (as long 
as there is no redistributive concern). 

If the monopolist did not observe the consumers’ types, 
the perfect-discrimination allocation could not be imple- 
mented. The high-demand consumers, whose surplus is 
fully extracted, would have an incentive to claim that 
they are low-demand consumers. Doing so would give 
them a strictly positive utility, because for the low-demand 
bundle the low-demand consumers have zero utility.'8 
Hence, the high-demand consumers would exercise per- 
sonal arbitrage. (The low-demand consumers would not.) 
In subsection 3.3.2 it will be shown that preventing the 
high-demand consumers from buying the bundle directed 
to the low-demand ones is costly to the monopolist. 


3.3.1.2 Monopoly Price 


Suppose that there is full arbitrage between consumers, so 
that the monopolist is forced to charge a fully linear tariff; 
T(q) = pq. The monopoly price, p, (or p™), maximizes 
(p —c)D(p), where D(p) is the aggregate demand: D(p) 
= 1— p/8. Thus, the monopoly price is 


cto 
Pa = > 


and the monopoly profits are 


(6 — ©)? 
40 


ll, = 


A caveat: These computations assume that the monopo- 
list decides to serve the two types of consumers. Another 
strategy might be to serve only type-0, consumers. Such 
a strategy will be optimal if the monopoly price rela- 
tive to this category (which is (c + 8)/2) exceeds 6, and 


the fraction of type-0, cor 
reduce the number of ca 
assume either that 


(c+ 6,)/2 <0, 


or that A is not too small 
under a linear tariff. 
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3.3.1.4 Comparison 
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esting is the comparison o 
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18. More formally, they would get a utility 


(2 — 8) V(Dy(c)) > 0. 


@,V(q)- T 


8, Vig) ~ T 
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Figure 3.5 
A two-part tariff 


all types are served), 

Pi =¢<p3<po =p” 

Thus, the marginal price is intermediate between the com- 
petitive price (which would be paid by the marginal con- 
sumer under perfect discrimination) and the monopoly 
price, The intuition is as follows: Starting from the monop- 
oly price p™, consider a small reduction of price (dp < 0). 
By the definition of the monopoly price, the change in 
price has only a second-order effect on the variable profit 
(p — o)D(p). However, the consumers’ surplus increases 
by an amount proportional to the reduction in price, a 
first-order effect. In particular, the monopolist can in- 
crease the fixed premium by an amount proportional to 
the reduction, 


0A = 65,(p) = —D,(p)dp > 0, 


and can therefore gain. Next, start from the competitive 
price. Raising the price by dp > 0 and lowering the fixed 
premium correspondingly to keep the type-#; consumers 
indifferent between buying and not buying leads to the 
same profit from these consumers, because perfect price 
discrimination is realized for these consumers. Thus, a 
small change in price has only a second-order effect on 


19. Remember that aggregate welfare is 


+ SH ph + 1b — ASE pt — c[AD, ip ® (1 4)DoK mh 


It decreases with p when p > c (SE 1S consumer 4's gross surplus.) 
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3.3.1.5 A Variant: Tie-in Si 
Tool 


Sometimes a manufacturer p 
is consumed in a fixed quant 
a complementary good—w 
competitive industry—is co 
For instance, consider the 


20, This exercise demonstrates a spec 
(1978). 


21. This exercise shows that everybc 
linear tariff, even under the informati 
This does not mean that the opHimal n 
dominates (or even is welfare-superior 


computers. The manufacturer of the computer (the good 
consumed in fixed quantity) would generally gain by 
requiring that the customers purchase the complementary 
good (the punch cards) from him as well, if he could 
avoid arbitrage (i.e., if he could check that the customers 
did not purchase the complementary good on the compe- 
titive market). Such a practice is called a tie-in sale. Higher 
consumption of the complementary good signals higher 
valuations for the good (higher 0, in our model); the sale 
of the complementary product serves as a counting (meter- 
ing) device,?* Hence, the manufacturer can use tie-in sales 
to practice second-degree price discrimination. This may 
explain why IBM required its customers to purchase 
only IBM punch cards. Xerox practiced a similar policy by 
charging a per copy fee on its copying machines (another 
alternative is to require the tied purchase of Xerox paper, 
if arbitrage can be prevented).?* 

To illustrate this within our model, suppose that a 
consumer, if he purchases at all, buys one unit of a good 
produced by a manufacturer and q units of a complemen- 
tary good. As before, his utility is @V(q) — T(q) if he 
purchases and 0 otherwise, T(q) is the two-part tariff 
charged by the manufacturer through a tie-in sale, The 
producer produces the basic good at cost co and the 
complementary good at c per unit. c is also the competi- 
tive price for the complementary good, as there are many 
firms willing to produce the complementary good at price 
c. The manufacturer has a monopoly on the market for the 
basic good. 

This set-up is similar to that in the above analysis 
of two-part pricing, except that the manufacturer must 
pay a fixed cost cg per customer served. Thus, if we make 
the assumption that both types of consumers are served, 


22, See Bowman 1957 and Burstein 1960a,b. The tied good need not be com- 
plementary with the tying good. One can also think of situations in which 
the consumption of the tied good brings the monopolist information about the 
consumer's willingness to pay for the tying good. On this, see Burstein 1960a, 
‘Adams and Yellen 1976, and the discussion on commodity bundling in the 
supplementary section 


23, See also Blackstone's (1975) study of the copying machine industry. SCM. 
a copying-machine firm, used a process (electrofax) that required special coated 
paper, As long as it had a monopoly position in the manufacture of this paper. 
SCM charged high markups on paper (about 200 percent) and low ones on 
machines (around 25 percent). Entry into the electrotax-paper industry, however, 
led this firm to struggle to keep its discriminatory policy, In particular, SCM 
used its monopoly power in the replenisher and service industries (where entry 
was slower) to force the customers to use its paper. Replenishers and service 
requirements were subsidized contingent on buying the paper. Similarly, SCM 
tried to tie the rental of its machines to the use of its paper. 
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26, The manufacturer's profit is 5; (c) 
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consumers. The important caveat here is, of course, that 
the prohibition of a tie-in sale makes it more likely that 
the manufacturer serves only the high-demand consumers. 
The result that tie-in sales are detrimental may then be 
reversed, as the following exercise will show- 


Exercise 3.5°* Assume Co = 0. 
(i) Show that the aggregate social surplus when tie-in 
sales are prohibited is 


(1 — 4)S,(¢) 
if (1 — 4)S,(0) > 5,(0. 


(ii) Let c= 1, 0, = 2, 0, =3, 4=%, and Dp) =1 
— p/0,. Show that prohibiting tie-in sales decreases aggre- 
gate welfare. 


Exercise 3.67" In the United States, the fast-food chain 
Chicken Delight used to tie paper packaging items— 
“buckets” for large orders and “kits” for individual dinners 
—to the use of the franchise name. Can you think of a 
potential explanation for this practice? (Hint: The fran- 
chisees were serving differentiated, exclusive territories.) 


3.3.2 Fully Nonlinear Tariffs and Quantity 
Discrimination?” 


Two-part tariffs are simple and common. However, a 
manufacturer who can fully prevent commodity arbitrage 
(but not personal arbitrage) can increase his profit beyond 
that obtained with the optimal two-part tariff by adopting 
a more complex scheme, 

Figure 3.6, which depicts the (q,T) space, explains 
why, The straight line represents the optimal two-part 
tariff T(q) = A + pq. The two types of consumers’ indif- 
ference curves are concave, because V(q) is. Because #, 
> 6,, the type-@, consumers’ indifference curve 1s steeper 
than the type-@, consumers’ when the curves cross.?8 
Under the optimal two-part tariff, the type-@, consumers 
pick the bundle B, and the type-#, consumers pick B,. By 
construction, the low-demand consumers have no net sur- 


127. This subsection ig more abstract than the rest of the chapter The reader 
may nevertheless want to invest in the techniques developed here and in the 
supplementary section, because vittually the same techniques are used in the 
theories of optimal taxation, regulation under asymmetric information. labor 
contracts, auctions, etc 


28. This condition—familiar in the literature on incentives—is called the 
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plus (their indifference curve through B, goes through the 
origin), whereas the high-demand consumers have a posi- 
tive net surplus. Figure 3.6 also shows the indifference 
curves for the monopolist (T — cq = constant). Because 
c <p, these indifference curves are flatter than the op- 
timal two-part tariff, 

An important feature of a two-part tariff is that none of 
the personal-arbitrage constraints are binding. In particu- 
lar, high-demand consumers strictly prefer B; to B,. In- 
deed, when the indifference curve for these consumers is 
drawn through B,. any point on or below this curve is 
accepted by them. It is then immediately apparent that 
the monopolist can increase his profit by still offering 
type-@, consumers a bundle under type-0,'s indifference 
curve through B, and above the monopolist’s indifference 
curve through B,. Actually, the optimal such point is Cy 
where the monopolist's indifference curve is tangent to 
the type-@, consumers’ indifference curve through B, 
Since the curve for type @) is steeper. it cannot contain the 
origin, because type 4,'s curve does. Thus, type 92 makes 
a rent.2° The profit from low-demand consumers is un- 
changed, and that from high-demand consumers is in- 
creased. Hence, the two-part tariff is not optimal. 

An interesting feature of the construction of C, is that 
the monopolist’s and the high-demand consumers’ indif- 


“sorting condition.” or the "single crossing condition.” or the "Spence-Mirrlees 
condition” In the present context it makes it possible to separate the 
two types of consumers by offering a higher consumption to the type: 
consumers 


29, The type-8, consumers do not want to exercise personal arbitrage by 
choosing C. 


Chapter 3 


ference curves are tangent. Hence, at C>, type 03's con- 
sumption is socially optimal: 4, = D,(0). 

To derive the optimal nonlinear tariff, one must find 
two bundles (q,, T,) and (q2, Tz) that maximize the monop- 
olist's profit under the constraints that personal arbitrage 
does not occur and consumers are willing to buy. This 
is done in the supplementary section, which basically 
shows that the optimal nonlinear tariff resembles the cou- 


ple (B,C). 


Conclusion 1 Low-demand consumers derive no net sur- 
plus, while high-demand consumers derive a positive net 
surplus, 


Conclusion 2. The binding (or relevant) personal-arbitrage 
constraint is to prevent high-demand consumers from 
buying the low-demand consumers’ bundle. 


Conclusion 3. The high-demand consumers purchase the 
socially optimal quantity, 9, = D,(c),?° while the low- 
demand consumers purchase a suboptimal quantity, 
< D,(o). 


Conclusion 3 is the economically most significant one 
Relative to the social optimum, the monopolist enlarges 
the spectrum of consumption patterns: (D, (c), D;(c)) becomes 
(q, < Dy(¢), D,(c)). The intuition is as follows. The monop- 
olist would like to extract the type-@, consumers’ large 
surplus, but this policy faces the threat of personal 
arbitrage: A high-demand consumer can consume the 
low-demand consumers’ bundle if his own bundle does 
not generate enough surplus. To relax this personal- 
arbitrage constraint, the monopolist offers a relatively 
low consumption to the low-demand consumers. Because 
high-demand consumers suffer more from a reduction in 
consumption than low-demand ones,*' this relaxes the 
personal-arbitrage constraint. Hence, the monopolist re- 
duces the quantity consumed by the low-demand con- 
sumers so that the high-demand consumers will be less 
tempted to consume the low-demand consumers’ bundle. 
Conversely, low-demand consumers are not tempted to 
exercise personal arbitrage, and there is no point to intro- 
ducing a distortion in the high-demand consumers’ con- 


40. This conclusion 1s known in the literature of optimal taxation (Mirrlees 
1971, Seade 1977) as ‘absence of distortion at the top. 


31. This is nothing but the sorting condition alluded to in footnote 28 Con- 
sider a small decrease in consumption dq, <0 for type-0, consumers compen- 
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sumption (any welfare gains stemming from a move of 
the marginal price toward the marginal cost can be cap- 
tured by the monopolist through an increase in T;). 

The supplementary section develops these ideas in more 
detail and shows how the analysis can be pursued for 
more than two types of consumers. It also draws the link 
between the optimal nonlinear prices employed for second- 
degree price discrimination and the Ramsey prices con- 
sidered in the study of third-degree price discrimination. 


3.3.2.1 Welfare 


The welfare analysis of nonlinear tariffs is ambiguous. 
There is no doubt that the nonlinear tariff chosen by 
@ monopolist is socially suboptimal. However, we must 
compare this tariff not with the socially optimal one but 
with that which would arise from a given government 
intervention. A much-studied policy intervention consists 
in forcing a monopolist to charge a linear price. (Such a 
limited policy is so eagerly considered because it does not 
require the government to have any information whatso- 
ever concerning the distribution of tastes among con- 
sumers, the firm's cost structure, etc.) The monopolist 
is made worse off by such an intervention because he 
loses flexibility in his pricing policy. The consumers, how- 
ever, do not necessarily gain. Under nonlinear pricing, the 
monopolist could extract some of the high-demand con- 
sumers’ net surplus while still selling to low-demand con- 
sumers. Under linear pricing, the monopolist may stop 
serving the low-demand consumers in order to extract the 
high-demand consumers’ surplus; hence, his output may 
well decrease. 

For more on the welfare analysis of nonlinear tariffs, 
see Spence 1977, Roberts 1979. and Katz 1983 


3.3.3 Quality Discrimination 


In subsections 3.3,1 and 3.3.2, the monopolist discrimi- 
nated among consumers by offering different quantities 
of the same good at different prices to consumers with 
different tastes for the good. A monopolist can also dis- 
criminate among consumers with different tastes for qual- 


sated by a reduction in price: 67, =U), \"y)!0q, <0 By construction this 
change leaves type-, consumers indifferent However, | hurts types: (iin 
sumers if they consume q,- their utility changes by 


0, Vn db, = OT; = Wy — 1 1VG) May <0 
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ity (or service) by offering an array of qualities. For 
instance, railroads and airlines offer several classes. Airline 
tickets also differ in ease of cancellation, in existence of a 
waiting list, and in other respects. 

It turns out that quality discrimination is very similar to 
quantity discrimination.*? To see the analogy, consider 
the vertical differentiation space studied in chapter 2. 
Consumers have unit demands for a good. The good 
comes in various qualities, indexed by s. Consumers have 
preferences LI = 0s — p (if they purchase), where s is the 
quality of the good purchased, p is the price paid, and 6 is 
the taste parameter. In general, the price depends on the 
quality: p(s). Suppose that producing one unit of the good 
with quality s costs the monopolist c(s), where c is increas- 
ing and convex. 

By a simple relabeling of variables. this quality model 
can be transformed into the previous quantity model. To 
this purpose, let q = c(s) denote the cost of quality s, Let 
s = V(q) =c7'(q) denote the inverse function, ie. the 
quality obtained for cost q (note that c increasing and 
convex implies V is increasing and concave). The con- 
sumers have preferences 


U = @Viq) — p(V(q)) = 6 V(q) — pq) 


(where j(q) = p(V(q))) if they purchase. And, by con- 
struction, the monopolist’s cost function is linear in q 
(with the coefficient of proportionality equal to 1). Hence, 
at a formal level the two models are identical, and we can 
transpose the conclusions of the quantity model to the 
quality model. In particular, we have (in the case of two 
types of consumers) the following conclusion: 


Conclusion 3’ The consumers with a high valuation for 
quality (type-0, consumers) purchase the socially optimal 
quality; the consumers with a low valuation for quality 


(type-#, consumers) purchase a suboptimal quality.** 


In other words, the monopolist enlarges the quality spec- 
trum, as was shown by Mussa and Rosen (1978). (See also 
O'Keefe 1981.) The monopolist uses lower-quality goods 
as a market-segmentation technique. 

For a real-world example of such a behavior, consider 


Dupuit's (1849) discussion of railroad tariffs for passenger 
traffic: 

It is not because of the few thousand francs which would have 
to be spent to put a roof over the third-class carriages or to 
upholster the third-class seats that some company or other has 
open carriages with wooden benches... What the company is 
trying to do is prevent the passengers who can pay the second- 
class fare from traveling third-class; it hits the poor, not be- 
cause it wants to hurt them, but to frighten the rich... And it 
is again for the same reason that the companies, having proved 
almost cruel to third-class passengers and mean to second-class 
ones, become lavish in dealing with first-class passengers. Hav- 
ing refused the poor what is necessary, they give the rich what 
is superfluous.>* 

(Note, however, that the optimal policy is to give the rich 
not what is superfluous but the socially optimal quality. 
The high quality may, however, seem exaggerated in 
comparison with the other, suboptimal qualities.) 


3.3.3.1 
Policies 


Application 1: Discrimination in Insurance 


In the insurance market, the type of a consumer refers 
to the probability that the consumer has an “accident” 
(is robbed, etc.). The quality of service corresponds to 
the amount of reimbursement in case of accident. If con- 
sumers are divided into two classes (high and low prob- 
ability of accident), it can be shown that the high-risk 
consumers value the reimbursement more than the low- 
risk consumers (in the same sense that type-0, consumers 
value quantity or quality more than type-0, consumers in 
the previous interpretations). The analogue of conclusion 
3 for the insurance market is the following: A risk-neutral 
monopoly insurance company optimally discriminates 
between the consumers by giving full insurance (the so- 
cially optimal policy) to high-risk consumers and sub- 
optimal insurance (i.e., reimbursing less than the damage) 
to low-risk consumers. Again, introducing a distortion 
for type-@, (low-risk) consumers is meant to make type- 
@, (high-risk) consumers less tempted to choose the 
“wrong” insurance policy (Rothschild and Stiglitz 1976; 
Stiglitz 1977; Wilson 1977). 


432. This subsection is inspired by part of Maskin and Riley's (1984) analysis. 


43. The socially optimal qualities s, and 5, are given by «(s;) = 8, and 
s=h 
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34. Quoted by Ekelund (1970, p 275) and Phlips (1983, p. 2161 
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Optimal discrimination in the insurance market is 
studied in the supplementary section. 


3.3.3.2 Application 2: Discrimination through Waiting 
Time or Price Dispersion 


Sometimes the variable s refers to some disamenity asso- 
ciated with the purchase of the good. For instance, con- 
sumers usually do not like to travel to faraway stores, to 
bargain at length, or to buy from stores with few services. 
Or, they incur a cost searching for the lowest price for a 
given good. In more general terms, the purchased good is 
tied to another good (a “bad,” rather), which represents 
the time spent (or, more generally, the disamenities in- 
curred) in obtaining the good. 

At first, it may seem that a firm ought to reduce the 
costs incurred in purchasing its goods as much as is pos- 
sible in the context of its own cost structure. This is 
because cost savings for the consumer can be appropriated 
by the firm through a higher price for the good. For 
instance, if the firm incurs no cost in charging a uniform 
price in different locations, it might as well do so in order 
to lower consumers’ search cost. This makes consumers 
more willing to purchase the product. Similarly, con- 
sumers are willing to pay more for the convenience of 
nearby outlets and quick service. 

The previous reasoning, however, implicitly assumes 
that consumers are homogeneous. Under consumer het- 
erogeneity, the monopolist may use the consumption of 
the bad as a signal of the consumer's willingness to pay 
for the good (valuation, demand).** For this to work, the 
consumers with the highest willingness to pay for the 
good (which the firm tries to determine) must also be the 
ones with the highest distaste for the bad.*° If this is the 
case. the firm may offer several bundles. A high (low) 
price for the good is associated with a low (high) con- 


35 The monopolist would generally prefer to bundle a positive amenity that 
15 related to the consumer's willingness to pay, but such an amenity may nat 
exist, 


36. For instance, Chiang and Spatt (1982) suppose that the consumer. who has 
unit demands, has utility a(t) ~ 6 — p where w is the valuation for the good 
1s the waiting time (s — — in our previous notation), and p is the price of the 
good. The assumptian is that v'(@) > 0 


37 Here the search cost represents the cost of sampling store prices and. more 
broadly, the costs of consulting with inends and sales personnel and reading 
Consumer Reports and newspaper advertisements 


3B Since Diamond's (1971) pathbreaking work on the integration of individual 
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sumption of the bad. The high consumption of the bad for 
a low price for the good is meant to prevent the high- 
demand consumers from exercising personal arbitrage, in 
the same way that Dupuit’s railroad monopolist refused 
to put a roof over the third-class carriages in order to 
frighten the rich. 

Salop (1977) presents a model in which consumers 
differ in search cost?” as well as in demand for the good 
and shows that discrimination through price dispersion 
may be a profitable strategy for a monopolist.*® Similar in 
spirit is the model of Chiang and Spatt (1982), whose 
monopolist discriminates through waiting times. 

On the welfare side, the policy of tying more of the 
bad to the good than is socially optimal need not be 
detrimental to welfare if the alternative is to force the 
monopolist to treat all consumers uniformly. As in the 
quantity and quality models, discrimination allows the 
monopolist to appropriate some of the surplus of high- 
demand consumers while still serving low-demand con- 
sumers. Preventing discrimination may eliminate some 
excessive consumption of the bad (ie, may lead the 
monopolist to put a roof on the third-class carriages), but 
may also lead the monopolist to stop serving low-demand 
consumers (discontinue third-class services). Hence, the 
welfare effects of the prohibition of discrimination are 
ambiguous. Indeed, Chiang and Spatt (1982) show that 
waiting-time discrimination by a monopolist can be Pareto 
superior (preferred by all consumers and the firm) to the 
situation in which the monopolist is forced to charge a 
uniform price to consumers and, therefore, offers the same 
waiting time to all consumers. 

The above analysis is a little inexact in that it ignores 
the issue of the individual rationality constraint—i.e., the 
fact that the monopolist cannot coerce the consumers to 
buy if they will receive a negative net surplus from his 


search with matket equilibrium, a sizeable literature has formalized the link 
between price dispersion and consumer or producer heterogeneity. Reinganum 
(1979) generates price dispersion from differences in production costs among, 
firms. Price dispersion can also be obtained when consumers have different 
search costs (Salop 1977. Salop and Stiglitz 1982, Axel 1977. von zur Muehlen, 
1980, Rob 1985: Shiglitz 1985) receive a different number of price offers 
(Butters 1977; Burdett and Judd 1983), or have different valuations for the good 
Diamond 1987\, Rosenthal (1980), Shiiony (1977), and Varian (1980) look 
at price dispersion in oligopoly with heterogeneous consumers: Pag and 
Hirschleifer (1987) also allow pnce matching In Benabou 1986 the price 
dispersion results trom the inflationary. environment and ftom costly price 
adjustments (under consumer homogeneity: or heterogeneity) 
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product. This may create problems. In the quantity and 
quality models, it suffices to impose this condition for 
the lowest-demand consumers. It is then automatically 
satisfied for consumers with a higher demand, because 
they derive a higher net surplus than the low-demand 
consumers from a given consumption bundle and they 
can exercise personal arbitrage. However, in the case 
of waiting-time discrimination or search discrimination a 
consumer with a higher willingness to pay also has a 
higher distaste for the bad. Hence, it is not clear that the 
high-demand consumer derives more net surplus from a 
given bundle than a low-demand consumer. This makes 
the analysis more complex. 


SS 
3.4 Concluding Remarks 


Both second-degree and third-degree price discrimina- 
tions are widespread. Their importance has triggered a 
vast and useful literature in economics. Much work re- 
mains to be done, however. 

Although we have assumed a single firm, most price 
discrimination takes place in oligopolistic markets. As 
long as the competitors are differentiated along some 
dimension (brand loyalty, location, consumer information, 
etc,), each competitor is tempted to price-discriminate 
on his residual-demand curve. It is then interesting to 
study the competitive aspects of price-discrimination 
instruments, such as sales, coupons, frequent-flyer pro- 
grams, bundling, and offers to match one’s rival's price. 
The extension of the theory toward oligopolistic markets 
is thus an important area of research.*? 

Furthermore, in our study of second-degree price dis- 
crimination we have assumed that the monopolist dis- 
criminates along a single dimension (either quantity or 
quality). In many interesting situations (e.g. computer sys- 
tems, WATS lines, industrial equipment), the consumer 
chooses both a quantity variable and a quality variable 
(for instance, the time pattern of usage of the units pur- 
chased). Panzar and Sibley (1978) and Oren, Smith, and 


39. For pioneering work in this area, see Borenstein 1985: Champsaur and 
Rochet 1986: Katz i984ab: Oren. Smith, and Wilson 1983, Thisse and Vives 
1986, 


40. Panzar and Sibley assume that the tariffs linear in both capacity and usage 


Wilson (1985) have studied the issue of pricing for capac- 
ity and usage.*° 

Another important assumption of our study of second- 
degree price discrimination is the independence of con- 
sumers’ demands from those of other consumers. The 
quality offered to a consumer may. however, depend on 
the choices of other consumers. This is the case when 
service is ranked by priorities of access because of limited 
capacity (as in the telephone, electric power, and air travel 
industries). Suppose that an electric utility, with a fixed 
capacity for a given period of time, faces uncertain de- 
mand or supply. During peak load, electricity must be 
rationed to some consumers. Random rationing gener- 
ally is not efficient, as some consumers suffer more than 
others from the interruption of service. A spot market, 
on the other end, would allocate the limited capacity 
efficiently among consumers in each state of nature, as 
those who would pay the spot price are those who suffer 
most from an interruption. Such a spot market, however, 
is generally infeasible because of the high transaction 
costs of setting it up. As an imperfect substitute for spot 
markets, the firm may offer priority of service, i.e., prob- 
ability of being interrupted. In practice, this takes the 
form of a few priority ranks. Here the interdependence of 
demands is apparent: if all consumers choose the top 
priority, no one has any priority. Thus, the firm must 
adequately compute the tariff for each priority class to 
match capacity and demand. Wilson (1986) makes impor- 
tant progress in this direction in his study of priority 
servicing by a monopolist, an oligopolist, and a social 
planner.** 


(a generalization of the two part tariff studied in subsection 3.3/1), Oren et al 
consider general nonlinear tans. 


41. For instance, Wilson concludes that in many cases. priority serviemg 
allows a social planner to realize most of the efficiency gains attributed to spot 
markets, See also Reitman 1986 and the references in these two papers 
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ed 
3.5 Supplementary Section: 


Nonlinear Pricing 


This supplementary section develops the study of non- 
linear tariffs begun in subsections 3.3.2 and 3.3.3. We 
derive explicitly the optimal tariff and study whether 
this tariff exhibits quantity (or quality) discounts. The 
first part of this supplementary section emphasizes tech- 
niques that can be applied to a variety of problems in 
economics, including topics relevant to industrial organi- 
zation, such as the theories of optimal regulation*? and 
optimal auctions.*? 


3.5.1 Nonlinear Prices** 


Let us take up the model of subsection 3.3.1. A firm 
produces a single good.** Consumers receive utility 6 V(q) 
— T(q) if they purchase a quantity q and utility 0 other- 
wise. The unit cost of producing the good is constant and 
equal to c. We consider first the case in which 6 takes two 
values and second the case in which # takes a continuum 


of values in an interval 


3.5.1.1 The Two-Type Case 


This is the case considered in the text above. A monopo- 
list offers two bundles: (q,. T,), which is directed at type- 
4, consumers (in proportion 4), and (q2, Tz), which is 
directed at type-0, consumers (in proportion 1 — 4). It is 
assumed that the monopolist serves both types of con- 
sumers (which will occur if 4 is “sufficiently large”). 

The monopolist’s profit is 


TI" = A(T, — cqy) + (1 — A(T, — cap). 


The monopolist faces two kinds of constraints. The con- 
straints of the first kind require that consumers be willing 
to purchase. (These are “individual rationality constraints,” 
in the jargon of the incentives literature.) In particular, 


42, See. eg. Baron and Myerson 1982, Sappington 1982. and Laffont and 
Tirole 1986 


43, See, eg. Maskin and Riley 1980. Milgrom and Weber 1982. Myerson 
1979, and Riley and Samuelson 1981 


44. For expositions of optimal nonlinear pricing see Goldman. Leland, and 
Sibley 1984; Maskin and Riley 1984; Oren, Smith, and Wilson 1984 


45. Nonlinear pricing for a multiproduct firm (but still assuming a one-dimen- 
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the net surplus of the low-demand consumers must be 
positive: 


8, V(q,) — T, 2 0. (3.3) 


If this condition is satisfied, the high-demand consumers 
are automatically willing to purchase (because they can 
choose to buy q, at price T, and get net surplus 4, V(q,) 
— T, > 0). The constraints of the second kind require 
that the consumers not exercise personal arbitrage. (These 
are known as “incentive-compatibility constraints.”) In 
particular, the high-demand consumers should not want 
to consume the low-demand consumers’ bundle: 


8, V(q2) — Tz > 0,V(q,) — T,. (3.4) 


Our previous analysis strongly suggests that the other 
incentive-compatibility constraint is not relevant—the 
idea is to induce high-demand consumers to “reveal” that 
they have a high demand, not the reverse. Thus, we will 
ignore the second incentive-compatibility constraint and 
later check that it is indeed satisfied in the solution of 
the “subconstrained problem.” (See figure 3.6 for the 
intuition.) 

Thus, the monopolist maximizes TI" subject to the 
constraints 3.3 and 3.4. Because the monopolist benefits 
from high prices, constraint 3.3 implies that T, = 0, V(q,). 
Constraint 3.4 then implies that 


T, = 4, V(g2) — 0,V(q,) + T, 
= 8,.V(g2) — (82 — 8;)V(q;) 


Notice the economic content of these equations. The 
transfer T, can be chosen so as to appropriate the type-0, 
consumers’ surplus entirely. T, must leave some net sur- 
plus to the type-@; consumers, because they can always 
buy the bundle (g,, T,) and have net surplus 


4, V(q,) — T, = (8, — 4,)Vig,). 


sional parameter for the consumers type! is considered in Mirman and Sibley 
1980 and in Oren et al 1982, For an exposition of techniques involved in the 
study of discrimination with several goods, see Guesnerie and Laffont 1984 For 
an examination of the quality-discrimination problem when the monopoly can 
also offer warranties see Matthews and Moore 1987 

‘The study of multidimensional type spaces is somewhat complex For a start 
on this in related models, see Laffont et al 1982, Kohllepel 1983 Quinzii and 
Rochet 1985. and Engers 1987 
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Substituting into the objective function, the monopolist 
solves the following unconstrained problem: 


max {4[@, V(q,) — ca] 


fqueaa} 
+ (1 — A)10, V(g2) — cq2 — (82 — 95) V(qy))}- 


The first-order conditions are 


1 lf; to 4e — 4 
6,V(a,) = (1 -—S a) (3.5) 
and 
8,V'(q2) = (3.6) 


From equation 3.6, the quantity purchased by the high- 
demand consumers is socially optimal (the marginal util- 
ity of consumption of the good is equal to the marginal 
cost). From equation 3.5 and the assumption that the 
monopolist serves both types of consumers, the quantity 
purchased by the low-demand consumers is suboptimal 
(8, V'(g,) > ©). (These two properties also imply that qz 
> q,.) This proves conclusion 3 in the text. 

Last, let us check that the low-demand consumers do 
not want to choose the high-demand consumers’ bundle. 
Because they do not realize a net surplus, we require that 


0> 6, Viq:) — Th. 

But this condition is equivalent to 

0 > —(0, — A )[V(g2) — Viqy)), 

which is satisfied. 

Upward-Binding Individual Rationality Constraints 


A monopolist faces two types of constraints: The con- 
sumer can exert personal arbitrage among various bun- 
dles offered by the firm (incentive compatibility), and he 
can refuse to buy from the firm (individual rationality). 
We assumed that if the consumer does not buy from the 
firm, he does not buy at all. This type of individual ra- 
tionality is downward binding, in that consumers with a 


Jow valuation for the good (low 6) are more tempted not 
to buy. A similar situation would generally occur if there 
existed an inferior substitute. Sometimes the substitute 
may be superior. For instance, it could, in the context 
of the previous model, yield utility k@V(g), where 9 
is the consumption of this substitute (mutually incom- 
patible with the consumption of the monopolist’s good) 
and k > 1. Assume further that this substitute is sold 
competitively at marginal cost ¢ (where = > kc). In this 
situation, the individual rationality constraint facing the 
monopolist may be “upward binding.” The problem may 
be to prevent the high-valuation consumers from buying 
the “superior” good.** If this is the case, the spectrum of 
quantities may again be enlarged relative to the first-best 
(perfect-discrimination) case, but in the opposite way: 
The consumption of the high-valuation consumers may 
exceed the socially optimal one. For more on this, see 
Champsaur and Rochet 1986.*7 


3.5.1.2 The Continuum-of-Types Case 


To further develop the techniques of nonlinear pricing 
and obtain a few additional results, let us assume that the 
taste parameter @ is distributed across the population of 
consumers according to the density f(@) (with cumulative 
distribution function F(@)) on an interval [0,6] (where 
0<8<8). 

The monopolist offers a nonlinear tariff, T(q), A con- 
sumer with taste parameter @ purchases q(@) units and 
pays T(q(@)). The monopolist’s profit is then 


‘ 
n= [ (T(q(@)) — ¢q(0)| f(0)d8. 
A 


The monopolist maximizes this profit subject to two 
constraints. 

First, all consumers must be willing to purchase: For 
all 4, 


8 V(q(@)) — T(q()) > 0. (3.7) 


(The monopolist may want to exclude some consumers 


46. The constraint can then be written 
Dy Vigg) — Ts & QU) = max (kd, Vs) — Gy 
47 An alternative application of this idea would pertain to vertical integration. 


lf the customer were a downstream firm, he would have the choice between 
buying one of the upstream monopolist’s bundles and producing the inter 


mediate good himself (ie., vertically integrating). In the latter case. he would 
pay a fixed investment cost K to have access to the technology of production 
af marginal cost ¢ (and k might be equal to # in this context) The high-demand 
customers would have a higher incentive to vertically integrate (as in Katz's 
model of third-degree price discrimination mentioned im the text), so the 
individual-rationality constraint might be upward binding, 
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from consumption; at a formal level, this can be repre- 
sented by q(@) = T(q(@)) = 0 for these consumers.) As in 
the two-type case, it actually suffices to require that the 
individual-rationality constraint 3.7 hold for the lowest- 
demand consumer: 


8 V(q(8)) — T(q(8)) > 0. (3.8) 


If constraint 3.8 is satisfied, a type-@ consumer also 
realizes a non-negative net surplus, because he can always 
choose to consume the type-@ consumer's bundle and 
obtain utility 


6 V(q(8)) — Tiq(@)) > (8 — 8) V(q(8)) > O. 


Second, a type-@ consumer must not choose the bun- 
dle chosen by type-6 consumers (where 6 # @). The 
incentive-compatibility constraints are, for all 6 and 6, 


U(8) = 8 V(q(8)) — T(q()) > @ Vigi6)) — T(q(@)). (3.9) 


The constraints represented by 3.9 are not very tractable. 
Fortunately, in this problem it will suffice to require that 
the incentive constraints are satisfied “locally” —i.e., that 
for 8 = 6 — d@ close to @ (see footnote 52) 


8 V(q(8)) — T(q(8)) > 8 Vig(@ — d@)) — T(q(@ — d@)). 


Assuming that q(‘) and T(-) are strictly increasing and 
differentiable,** we obtain for all @ 


4.V'(q(8)) — T(q(@)) = 0, (3.10) 


which expresses the fact that a small increase in the quan- 
tity consumed by the type-@ consumer generates a mar- 
ginal surplus @ V'(q(@)) equal to the marginal payment 
T’'(q(@)). Hence, the consumer does not want to modify 
the quantity at the margin. Equation 3.10 can be used to 
obtain the payment function once the quantity function 
q(6) is known: Assuming that the optimal q(8) is strictly 
monotonic in @ (which will be derived shortly), one has 


T'(q) = x(q) V'(q), (3.11) 


where x(-) is the inverse of the quantity function—i.e. 
2(q) is the type that consumes quantity q («(q(@)) = 


48. It can indeed be shown that these two functions necessarily are differen- 
tiable almost everywhere: From equation 3.9, q must be nondecreasing in 8. 
using the same proof as the one in chapter 1 showing that the monopoly pce 
increases with marginal cost (T must obviously be nondecreasing in 9; otherwise 
some bundles would not be chosen}; and a monotonic function is almost 
everywhere differentiable 
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To derive the optimal quantity function, it is actually 
convenient to use a perhaps less natural representation of 
the incentive constraint.*? Let LI(@) denote (as before) the 
utility, or net surplus, of the type-@ consumer. From the 
incentive-compatibility constraint, 


U(8) = 8 V(g(8)) — T(q(6)) = wax (8 Vigi6)) — Tig’). 


From the envelope theorem, the derivative of LI with 
respect to @ takes into account only the direct effect of 4, 
and not the indirect effect stemming from the adjustment 
in quantity: 

du_ 

= u'(@) = Viq(@)). (3,12) 
Integrating equation 3.12, we can express the utility of 
the type-# consumer as*° 


8 0 
u@)= f V(q(u))du + U(8) = [ Viqu))du, (3.13) 
6 


where use is made of the individual rationality constraint 
(U(8) = 0). 

The fact that the consumer's utility as a function of 0 
grows at a rate that increases with q(@) will turn out to be 
crucial. Higher quantities “differentiate” different types 
more, in that the utility differentials are higher. Because 
leaving a surplus to the consumer is costly to the monop- 
olist (recall that the transfer T(q(@)) is equal to 8 V(q()) 
— U(@)), the monopolist will tend to reduce LU and, to 
do so, to induce the consumer to consume a suboptimal 
quantity. Equation 3.13 suggests that it is more desirable 
to reduce the quantity (relative to the socially optimal 
quantity defined by @ V'(q(@)) = c) more for low-0 con- 
sumers—an increase of 6g > 0 in the quantity purchased 
by a type-@ consumer raises the utility of all types #’ > 0 
by V’(q(@))dq > O but does not affect the utility of types 
6’ < @). Hence, we expect the low-demand consumers 
to consume much less than their socially optimal quan- 
tity and the highest-demand consumer (6 = @) to con- 
sume exactly his socially optimal quantity—a conclusion 


49. This trick was first used by Mirrlees (1971) 
50. Equation 3 13 is the analogue of the equation 
UA,) = (8: — 4) Vig) 


in the two-type case. 
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analogous to conclusion 3 in the text. If this is indeed so, 
the monopolist enlarges the quantity spectrum toward 
low quantities (ie., low qualities in the quality interpreta- 
tion). Let us now prove these results formally. 

Since T(q(@)) = @ V(q(@)) — L(@), the monopolist’s pro- 
fit can be written as 


cr 6 
ni" = t (0 V(q(8)) — [ V(q(u))du — cal) ) Oa 
a e 


Integrating by parts*! yields 


a 
m= f {18 Vig()) — cq(O)iF(O)" 


@ 


— V(q(@))[1 — F(@)|} 40. 


The maximization of I1™ with respect to the schedule q(-) 
requires that the term under the integral be maximized 
with respect to q(@) for all 0: 


1— F@) 
® vga. 
Fay 9) 


We thus obtain the conclusion that the marginal willing- 
ness to pay for the good exceeds the marginal cost, ex- 
cept for the highest-demand consumer (@ = 8). Hence, 


@V"(q(8)) = e+ 


(3.14) 


the monopolist induces consumers to purchase a sub- 
optimal quantity ** 

Next, for this type of consumer preference, one can get 
a simple expression for the “price-cost margin.” Let T'(q) 


51. In this integration by parts, it 1s convenient to take —|1 ~ F()j as the 


integral of HM). 


52 We represented the incentive-compatibility constraints by the first-order 
condition 5.10, To be certain that 410) is indeed the optimal choice of a type-l! 
consumer we must check the second-order condition associated with the con 
sumer's optimization over quantities both locally and globally Let U( @) 
denote the utility of a consumer with taste #! when he consumes the quantity of 
a consumer with taste # 


Lie fy =o Vegi — Teg 
The first-order condition is. for all / 
Uawn = 0, 


whete subscripts denote partial derivatives in other words. choosing aif) is 
optimal for a type-th consumer Differentiating the First-order condition with 
respect to gives 


Lgat hy = = Ugaith i) 
The local second-order condition is thus equivalent to 


Uli.) > 0. 
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= p(q) denote the price of an extra unit when the con- 
sumer already consumes q units. We know from consumer 
optimization that 


T'(q(@)) = 8 V'(g(@)). 
Substituting into equation 3.14, we get 


po—c_1—F(6) 
“p OF@) el 


where p = p(q(@)). 


An assumption commonly made in the literature is that 
the “hazard rate” of the distribution of types 


F(8) 
1 — F(@) 


increases with @.°3 This property is satisfied by many 
distributions, including the uniform, the normal, the Pareto, 
the logistic, the exponential, and any distribution with 
nondecreasing density. On the assumption that the hazard 
rate is increasing, 
1— F(6) 

4—=— 

f(0) 
increases with 0. From equation 3.14 and the fact that V 
is concave, q(@) increases with 8. Furthermore, 


1— F@) 
6f(8) 


But 


tgio.d) = ign) > 0,8! 's 
al.) = Vig)" T= > 0.as "1 0 


To check the global second-order condition, suppose that 
LQ) > Wid. 
for some #), and li; This implies 
Uy, 2hdx > 0 
Suppose for instance (ly >!) Because Liggitl.0) > 0, we have 
Us) < Unie, 1) = 0 


for r > 0, (where use is made of the first-order condition), We thus obtain 
contradiction, And similarly for A; <6, 


53. To understand why this is called the hazard rate, suppose that one maves 
along the ( axis from # toward and eliminates types that are ‘passed by 
Arriving at # and moving by dé to the nght one finds that the conditional 
probability that the consumer's type belongs to {f+ di) and is thus elimt- 
nated is Hé)d0//1 — Fil)) 
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decreases with 8, so that the price-cost margin decreases with 
the consumer's type and, therefore, decreases with output. 

Let us derive some further properties of the optimal 
payment function T(-). Recall that T’(q) = p(q). Hence, 


ig) — 4? _ 4p (a 
T'(q) = dq > d0| a0” 
But dq/d@ > 0, and, from equation 3.15, dp/d@ < 0. Hence, 
the payment function is concave. It is represented in fig- 
ure 3.7. 

Two properties result from figure 3.7: 


+ The average price per unit, T(q)/q, decreases with q.5* 
(This is the Maskin-Riley quantity-discount result.) 


+ Because a concave function is the lower envelope of its 
tangents, the optimal nonlinear payment schedule can 
also be implemented by offering a menu of two-part 
tariffs (where the monopolist lets the consumer choose 


T(q) 
ry 


Figure 3.7 


34. The slope of T at q(i!) is imdeed smaller than that of the ray through the 
ongin and the point (9(@), Ta(d)) 


Vigid) 
ovigii<e nat 


ad 
because V js concave 


55. This point is made in a different context in Laffont and Tirole (986. There. 
a government offers a menu of contracts to a regulated firm haying private 
information about its technology. The optimal contracts can actually be chosen 
linear in cost overruns. That is. the government discriminates on the basis of the 
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among the continuum of two-part tariffs). This can be 
seen in the figure, where a consumer of type @ indeed 
chooses the two-part tariff with slope p(q(@)).5°:5* 


More General Demand Functions 


As we earlier noted, the above theory holds for more 
general gross surplus functions V(q,0) as long as 6V/00 
> 0 (which is just a normalization) and 67 V/dqd@ > 0 
(the single-crossing or sorting condition). Equation 3.14 
then becomes 

1— FiO) eV 


ev 
Bq )-8) = == (4(8), 8). 


710) aqa0 He 


The following exercise, inspired by Spulber (1981) and 
Wilson (1985), makes use of equation 3.14’. 


Exercise 3.7°* A monopolist with marginal cost c sells 
to heterogeneous consumers. The latter differ in their 
transportation cost tq for purchasing quantity q. The 
parameter | is distributed according to a c.df. G(f) on 
(0, +20), with density g(t). The monopolist does not sup- 
ply transportation. (Assume, further, that the consumers 
cannot arbitrage among themselves.) The consumer's util- 
ity function is 


(1— (1 —4)?/2 — tq — T(q) 


when purchasing q units. 

(i) How would you define @, Vig, @), F(8), and f(@) so 
as to be able to apply the general theory? 

(ii) Let p(q) = T’(q). Use equation 3.14’ to compute the 
optimal p(q). 

(iii) Assume that G(t) = on [0,1] (where « > 0). 
Compute the optimal nonlinear tariff and show that it 
exhibits quantity discounts. 


fraction of cost overruns reimbursed to the firm The use of linear lie, two-part) 
tans is more interesting in this incentive context because of the presence of 
uncertainty. Under risk neutrality, the linear contracts are still robust (optimal) 
under any kind of cost uncertainty or measurement error 


50. With two types of consumers. it turns out that two two-part tariffs do not 
allow for the implementation of the optimal nonlinear tariff This looks bizarre 

as it would seem that it would be more difficult to use two-part tariffs with 
more types. However. think of a two-point distribution as a continuous distri 

bution with two atoms. This distribution ts not welt behaved. in that its hazard 
rate 1S not monotonic (which was a sufficent condition for the implementation 
with two-part tariffs! 
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Quantity Discounts and Quantity Premia 


Maskin and Riley's result on quantity discounts relies on 
their plausible assumptions about the shape and distribu- 
tion of consumers’ preferences. In general, however, the 
optimal nonlinear tariff may involve quantity premia in- 
stead of quantity discounts. Suppose that a good can be 
consumed only in integer numbers and that there are two 
types of consumers. A low-demand consumer wants to 
consume exactly one unit of the good. His gross surplus 
is 1 if he consumes one unit or more and 0 otherwise. A 
high-demand consumer wants to consume exactly two 
units of the good. His gross surplus is 4 if he consumes 
two units or more and 0 otherwise. The optimal tariff for 
the monopolist is clearly T(q) = 1 for q = 1 and T(q)= 4 
for q = 2. In this special case, the monopolist can dis- 
criminate perfectly; and more important, the average price 
is I for a purchase of one unit and 2 for a purchase of two 
units. 

Under the quality interpretation of the model, a good 
example of a premium involves automobile manufacturers, 
who typically try to extract surplus from high-valuation 
consumers (ie, consumers who highly value luxury and 
prestige). The profit margins on top-of-the-line cars and 
optional equipment are generally higher than those on 
basic cars and equipment,*’ and this suggests the exis- 
tence of quality premia. 

Thus, we cannot determine on a priori grounds whether 
quantity (or quality) discounts or premia are optimal. 
Only a careful consideration of the likely shape and dis- 
tribution of consumer preferences can determine the op- 
timal business strategy. 


Welfare Aspects 


Katz (1983) shows that nonlinear pricing may yield too 
little or too much output in comparison with the social 
optimum. (Furthermore, the distribution of this given out- 
put between consumers is not welfare maximizing, be- 
cause the efficient means of rationing a fixed output is 


57, Scherer |1980. p, 3941 quotes a memorandum on the 1966 Ford Galaxie 
four-door sedan revealing that, whereas the wholesale price of the basic car 
with standard equipment exceeded standard accounting costs by {7 percent 
the markups on optional equipment were characteristically much higher— ‘e.g 
293 percent for a more powerful V-8 engine. 123 percent for power steering 
and 58 percent for air conditioning.” 
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uniform pricing.) However, if the single-crossing condi- 
tion holds (as is assumed here), the monopolist generally 
produces too little output. 


3.5.13 Optimal Nonlinear Tariffs as Ramsey Prices** 


Let us draw the analogy between the optimal non- 
linear tariff (given by equation 3.15, say) and the inverse- 
elasticity rule, in order to unify the theories of second- 
degree and third-degree price discrimination. 

To this purpose, let us decompose the aggregate- 
demand function into independent demands for marginal 
units of consumption. Fix a quantity q and consider the 
demand for the qth unit of consumption. This unit has, by 
definition, price p. The proportion of consumers willing to 
buy this unit is 
D,\p) = 1 — F(85(p)), 
where 0s(p) denotes the type of consumer who is indif- 
ferent between buying and not buying the qth unit at 
price p: 

45 p)V'(q) = Pp. 
The demand for the qth unit is independent of the de- 
mand for the gth unit for @ # 4.°? We can thus apply the 


inverse-elasticity rule. The optimal price for the qth unit 
iS given by 


poe _ 

aD, p 
However, 
AD, 0; di; dp 
—t = —F (9° A. d —=- 
a HOPG = er 


We thus obtain 


pae i= F(O3(p)) 
Pp 83 p) F(83(p))” 


which is nothing but equation 3.15. 


58. This subsection is inspired by Brown and Sibley 1986 and by Goldman et 
al. 1984 


59. This independence is due to the absence of income effects. The price 
charged for the inframarginal units has no influence on the demand for the 
marginal unit 
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3.5.2 Commodity Bundling 
3.5.2.1 Homogeneous Good 


We saw that a discriminating monopolist may want to 
practice quantity discounts. The price of two units of a 
good is then lower than twice the price of one unit. In a 
sense, the two units purchased by a consumer are bundled 
(cannot be bought separately at no extra cost). However, 
we cannot conclude that the monopolist induces con- 
sumers to consume more by offering quantity discounts. 
We must give a precise definition of “consuming more.” 
If we take the efficient solution as a reference, we actually 
see that discrimination by the monopolist may lead to 
suboptimal consumption. 

The monopolist's offering the choice between one and 
two units of the good is an instance of what Adams and 
Yellen (1976) call “mixed commodity bundling” (for in- 
stance, airline companies offer one-way as well as round- 
trip tickets). Pure commodity bundling occurs when the 
monopolist offers only the two-unit bundle. Although 
such an all-or-nothing policy is not, in general, optimal,°° 
it is easily justified in circumstances in which there are 
economies of scale in production or in distribution. Jt may 
be cheaper to manufacture or sell two units together 
rather than twice one unit. 

To understand when pure bundling may occur in the ab- 
sence of economies of scale, recall the price-discrimination 
model. Suppose that there are two types of consumers, 
with low and high demands. Suppose further that the 
efficient allocation is for the low-demand consumers to 
consume one unit of a good and for the high-demand 
consumers to consume two units (to simplify, the good 
must be consumed in integer numbers). As we saw in 
section 3.3, if the proportion of high-demand consumers 
is high, the monopolist does not want to serve the low- 
demand consumers in order to extract the high-demand 
consumers’ surplus; thus, he induces them to consume 


60. As Adams and Yellen show, mixed bundling always dominates pure bund 
ling. Intuitively, the monopolist has more “instruments” Suppose that the 
pure-bundling price is yp. The monopolist can still sell the bundle at price py 
and sell each unit at price py — c (where c is the unit cost). This policy cannot 
generate a lower profit than the pure-bundling one. The reasoning easily 
extends to the bundling of heterogeneous goods 


81. We still rule out economies of scale, The cost of producing two units is the 
same whether the monopolist has chasen the one-unit or the two-unit package 
size. We only assume that the two sizes are mutually incompatible. 


zero units rather than one. The optimal policy is then to 
offer the two-unit package only (for a payment equal to 
the high-demand consumers’ gross surplus for two units). 
We thus have an instance of pure commodity bundling. 

Pure commodity bundling becomes even more likely 
when, for technological or marketing reasons, a good 
must be sold in either one-unit or two-unit packages, but 
not both. (Think of a firm having to choose one size— 
either size 1 or size 2—for its product.®') In the previous 
example, the same profit was obtained by selling two 
units to the high-demand consumers and zero to the 
low-demand consumers, and the monopolist chose the 
size-2 technology.®? In this example, however, the pro- 
fitability of serving the two types of consumers is re- 
duced, because the single-size constraint prevents price 
discrimination. This makes the policy of serving high- 
demand consumers only (and, hence, offering a two-unit 
consumption deal only) more appealing. The following 
exercise illustrates this reasoning. 


Exercise 3.8°* Consumers have preferences Li = 6 V(q) 
— T. The consumption q can take a value of 0, 1, or 2 
V(0) = 0, V(1) = 1, and V(2) = 3. The unit production 
cost is c= } whatever the size of the bundle. There are 
two types of consumers: #, = I (in proportion 4) and 
8, = 2 {in proportion 1 — 4). The consumers can engage 
in personal arbitrage. 

(i) Show that, in the absence of a technological con- 
straint (i.e, in a case where the monopolist can product 
both sizes), the monopolist uses pure commodity bundl- 
ing if and only if 4 <3. 

(ii) Suppose that, for technological reasons, the monop- 
olist must choose to produce the good in either size 1 
or size 2. Show that the monopolist chooses the size-2 
technology if and only if i < $ 


Erercise 3.9" 
only means of attending a certain sports or cultural event 


Buying a season ticket is sometimes the 


62. This allocation cannot be reached using the size-1 package. To extract the 
high-demand consumers’ gross surplus for two units. the monopolist would 
have to charge half this gross surplus. But because the gross surplus function 
1s strictly concave, the high-demand consumers would only consume one 
unit—mathematically 


Price Discrimination 


Often, however, tickets can also be bought for specific 
events as well. Discuss this in light of the above argu- 
ments (and, possibly, others that you may want to bring 
in). 


Heterogeneous Goods 


The previous discussion focused on bundling of several 
units of a single commodity. Commodity bundling can 
also affect several commodities. For instance, a restaurant 
ties the consumption of several dishes into a menu, a bank 
offers an indivisible array of services (e.g.. checking, safe 
deposit, and travelers’ checks), and a tour operator sells 
comprehensive vacation plans. 

Formally, the tying of units of several goods is similar 
to that of several units of the same good. However, while 
one can make reasonable assumptions about the cross- 
distribution of marginal utilities for the good in the single- 
commodity framework,°? it is harder to formulate a priori 
restrictions on the cross-distribution of the util 
the various goods. It is, therefore, not surprising that 
the theory of multi-commodity bundling has focused on 
examples. The exercise below gives one example; other 
examples and theoretical developments can be found in 
Adams and Yellen 1976, Telser 1979, Schmalensee 1984, 
and Lewbel 1985. 


ies for 


Exercise 3.10” A common practice in the U.S. film in- 
dustry (before it was outlawed) was the bundling of 
several films at the distribution level. Stigler (1963) 
offered the following simple model to formalize this 
practice, which was also called “block booking”: There 
are two downstream units (theaters), two films, and 
one monopoly film producer. The first downstream unit 
values film I at 4 and film 2 at 1. The second values 
film 1 at 3 and film 2 at 2. The value of a bundle to 
each downstream unit is equal to the sum of its valuations 
for the two films (there is no interdependence effect) 
Show that the film producer wants to bundle the two 
films. 


63. For example, one can make the Spence-Mirrlees assumption that the mar 
inal utility of the good at any quantity increases with the consumer's “type. 


c4, Let e denote the unit cast and price of the other product. If the monopolist 
charges ty tor the bundle, he can do the same by selling his good separately at 
price ty —r. This result does not extend to correlated valuations. Schmalensee 


It was mentioned above that mixed bundling dominates 
pure bundling (weakly, at least). It also dominates unbun- 
dled sales (for the same reason: the monopolist has more 
instruments). But, as Schmalensee (1982b) shows, it need 
not strictly dominate unbundled sales; a monopolist does 
not gain from bundling his product to another product 
that is produced competitively if the consumers’ reserva- 
tion values (valuations) for the two products are indepen- 
dent.°* McAfee, McMillan, and Whinston (1987) provide 
very general conditions under which, even for indepen- 
dent reservation prices, the monopolist strictly prefers 
mixed bundling over unbundled sales.°* 


3.5.3. The Insurance Market 


Finally, consider the analogy between an insurance mar- 
ket. in which consumers differ in their nsk, and the quality- 
differentiation model. A consumer with initial income I 
can face either of two states of nature. In state of nature 2 
(“accident”), which occurs with probability @, he incurs a 
loss with monetary equivalent L. In state of nature I (‘‘no 
accident”), which occurs with probability (1 — @), he in- 
curs no loss. The probability # is exogenous (there is no 
“moral hazard"). The consumer has a von Neumann— 
Morgenstern utility function U 

A risk-neutral firm, which (to simplify) has monopoly 
power, can supply insurance to the consumer. For a fixed 
premium p, to be paid in all states of nature, the firm 
reimburses an amount s in case of an accident. s can be 
seen as an insurance service. The consumer's expected 
utility for the insurance policy (p.s) is 


ou —p—L +s) + (1 —@)ul — p), 


and the firm's expected profit is p — 6s. 

Assume first that everything (except the realization of 
the state of nature) is known to both parties. The firm 
offers an insurance contract to the consumer before the 
realization of the state of nature. The contract is accepted 
by the consumer only if it gives him an expected utility 
exceeding that in the absence of insurance contract. The 


shows that if the consumers’ reservation prices for the two goods are nega 
ively cortelated (the above exercise yields an example of negative correlation) 
the monopolist may stnetly gain by pursuing mixed bundling 


65, See also Chae 1987 for an analysis of bundling for a specific (independent! 
distribution of reservation prices. 
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individual rationality constraint can be written as 
@U(I—p—L +s) + (1—8)UUI— p) 


> eull—L) + (1— a) UN). (3.16) 


The maximization of the expected profit (p — @s) over 
the contract (p,s) subject to constraint 3.16 implies that 
s=L. The firm fully reimburses the consumer in case 
of accident (full insurance). This is a classic result in the 
theory of insurance. Just as efficiency requires that marginal 
rates of substitutions be equated across goods and con- 
sumers, Borch’s (1968) rule says that the consumer's rates 
of substitution between income in the two states of 
nature, 


u'I—p—L+s) 
u'(I— p) 


must equal the firm’s (1/1), or 
l—p—L+s=I—p; 


that is, s = L.°° 

Second, assume that the consumer knows his risk param- 
eter @, but that the firm does not. This is the Rothschild- 
Stiglitz (1976)—Wilson (1977) model of discrimination in 
the insurance market.°’ Assume for simplicity that there 
are only two potential probabilities of accident: 0, < 4; 
(The case with a continuum of types can be treated as in 
subsection 3.5.1.2.) The insurance company then offers 
two contracts: | p;,5;} and { pz, 2}. 

Let u(p,s,@) denote the utility of a consumer with 
probability of accident @ for the contract (p, s). This con- 
sumer's marginal rate of substitution between insurance 
services and income is. by definition, equal to 


eu ( Cu ) 
és ep} 
A straightforward computation shows that 


é | Gu eu 
al zel(-#) we 
This inequality means that when the probability of acci- 
dent increases, the consumer is more eager to buy in- 
surance services. In other words, he is willing to pay a 
higher marginal premium for a given marginal increase in 
services. Hence, in our discrimination problem, type-, 
(“high-risk”) consumers are more eager to buy insurance 
than type-6, (“low-risk”) consumers. 

Inequality 3.17 is the sorting (or Spence-Mirrlees) con- 
dition for the insurance market. It implies that the only 
way to discriminate between the two types of buyers 
is to induce high-risk consumers to get more insurance: 
52 > s,. There is a strong analogy between this model 
and the quality-discrimination model. 

Although the model does not quite fit the consumers’ 
objective function of subsection 3.5.1, the previous tech- 
niques and conclusions can be extended.®°* In particular, 
conclusion 3 becomes as follows: The high-risk consumers 
get full insurance (s, = L), while the low-risk consumers 
get suboptimal insurance (s, < L), As before, the intuition 
is that the firm wants to prevent the high-risk consumers 
from consuming the low-risk consumers’ bundle.°° It re- 
duces the services offered to the low-risk consumers to 
discourage the high-risk consumers from exercising per- 
sonal arbitrage. This policy is indeed profitable, because a 
reduction in services is relatively less costly to the low- 
risk than to the high-risk consumers. Figure 3.8 illustrates 
this result. 

The derivation of the optimal insurance contracts fol- 
lows a few simple geometrical considerations (see Stiglitz 
1977 for a formal argument). The figures are drawn in the 
consumption space rather than in the (p,s) space. 

In figure 3.8a, E depicts the bundle of consumptions 
in the two states of nature in the absence of insurance 
(p =s =0). This bundle is the same for both types of 
consumers. The consumers’ indifference curves through 


66. The different states of nature can be thought of as different goods The 
theory of uncertainty considers the same physical good available in two dif- 
ferent states of nature as two different economic goods. See Debreu 1959, 
chapter 7 


67. The monopoly case is actually considered in Stightz 1977. Rothschild and 
Stiglitz 11976) and Wilson (1977) couch the problem in a competitive insurance 
market, 


68 The Maskin-Riley framework includes the insurance market as a special 
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case. One similarity with the quality-discrmination model 1s that the type who 
is most eager to buy the services benefits from the asymmetry of information 
if and only if the other type is served by the monopolist, See below 


62. In the quality or the quantity model, the low type would pay less than the 
high types for the right to buy at marginal cost if the firm could tell different 
types apart (ic.. if the firm had full information), Here, under full information 
the firm would offer the low type a lower premium for the service s = | 
because the low type does better without insurance than the high type 
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A soprotit curve Type-8, consumers’ 4 
when 6 = Ingitference curve 
(slope: ~ (1-8,)/8,) 


Type-é; consumers’ | 
inditference curve | 


Uy 


curve when 


45° (slope: -(1-8,)/8)) 


Figure 3.8 

(a) Complete information. Vertical axis represents income (accident) 
\=1—p—L + 5); horizontal axis represents income (no accident) 
(= 1 =p) (b) Incomplete information: screening possibilities. (c) 
Incomplete information: 4 small. Low-risk consumers do not buy 
insurance. (d) Incomplete information: / large. Low-risk consumers 
buy: a suboptimal amount of insurance 


E are represented. The low-risk consumers’ indifference 
curve is always steeper than the high-risk consumers’ 
Under complete information about 0, the monopolist of- 
fers full insurance at the minimal level at which consumers 
buy (point A, for type #,). Thus, the individual-rationality 
constraint is binding. Note also that the high-risk con- 
sumers would like to be able to purchase the low-risk 
consumers’ full insurance policy. 

Figure 3.8b depicts how the two types of consumers 
can be separated when the monopolist does not observe 
0. For instance, if B is the bundle directed at low-risk 
consumers, any allocation for the high-risk consumers 
must lie in region R, (it must be preferred to B by type-03 
consumers, but it must not induce type-0, consumers to 
abandon B). Conversely, if B is the allocation targeted for 
type-#, consumers, that for type-#, consumers must be- 
long ta region R, 

Figure 3.8¢ depicts the optimal allocation when the 
market is composed mainly of high-risk consumers. High- 
risk consumers obtain full insurance but do not derive any 


net surplus from the insurance policy (point c3); their 
utility, as in the full-information case, is the same as the 
one they would obtain in the absence of insurance. Low- 
risk consumers do not buy insurance (point c,). To 
understand why this allocation is not easily upset by offer- 
ing some insurance to low-risk customers, one can draw 
the latter's indifference curve through E. Any contract 
that is accepted by them lies to the northeast of this 
indifference curve. Furthermore, to generate gains from 
trade between the risk-neutral insurance company and the 
risk-averse low-risk customers, this point must represent a 
move from E toward the diagonal (i.e., must involve posi- 
tive insurance). However, the monopolist must then leave 
a positive net surplus to the high-risk consumers in order 
for them not to buy the low-risk consumers’ bundle. This 
is too costly to the monopolist if the proportion of high- 
risk consumers is large. The new trade generated by the 
new contract is not worth its cost in terms of incentives. 

When the proportion of low-risk consumers is smaller, 
the trade-off operates the opposite way (figure 3.8d). The 
monopolist prefers to sell to both types of consumers. 
The low-risk consumers, who choose D,, do not derive a 
net surplus from the possibility of insurance; the high-risk 
ones do by choosing D,. The high-risk consumers’ net 
surplus is the minimum one consistent with their not 
choosing D, 

In both cases, the high-risk consumers are efficiently 
(ie. fully) insured and the low-risk consumers are sub- 
optimally insured. 
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Answers and Hints 
Exercise 3.1 


(i) The consumer maximizes gross surplus minus the 
tariff. But this objective function is equal to zero for any 
level of consumption. Thus, the consumer is willing to 
consume the competitive level q = q‘/n. Because the 
consumers’ surplus is completely captured, the monop- 
olist’s profit is equal to the maximum social surplus. The 
reader who worries about the consumer's indifference 
among all levels of consumption can imagine that the 
tariff is equal to the gross consumer surplus for q # q°/n 
and equal to the gross consumer surplus minus « for 
q = 4° /n. The consumer then strictly prefers the competi- 
tive quantity. 

(ii) The new (residual) inverse-demand curve for the 
monopolist is 


for q < Dp») 
Plq) = es 0, 
® = \pGqy for q > Dipo). 


Assuming for simplicity a constant marginal cost ¢ and a 
single consumer, the optimal two-part tariff is p = c and 
A = J% (P(q) — cldq. Alternatively, the monopolist can 
charge the fully nonlinear tariff: T(q) = {% P(x)dr. These 
results are depicted in figure 3.9 


Exercise 3.2 


Let q = a — b(p + tx) denote the demand function, where 
pis new the fob: price for a consumer located af 2. 

(i) Under nonuniform f.o.b. prices, the producer 
maximizes 


(p — O(a — bp — bts). 


for all x, This yields a delivered price of 


a a, & 
pa) += +5 +s. 


Note that there is a 50 percent freight absorption. 
(ii) Under a uniform f.o.b. price p, the producer 
maximizes 


i 
[ (7 —0)(a — bp — bind 
0 


This yields 
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>q 


Under discrimination, the delivered price is higher for 
x <} and lower for x > 4. Hence, uniform pricing involves 
a cross-subsidization from distant customers to local ones. 

(iii) A uniform f.0.b. price tends to reduce the market, 
since the delivered price for the consumer located at 
x= 1 is higher. 


Exercise 3.3 
Serving both types yields 
Tl, = (6 — c)?/48, 


as is seen in the text. Serving only the high type (at 
monopoly price (c + 63)/2)) yields 


Mh, = (1 — 4)(@, — 07/40, 


TI, — M5 grows with 2 (Warning: 0 depends on 4.) For 
A 


11, — 1; =, >0, 
and for 2 = 0, 
M1, - 1, <o. 


Hence, 11, > I; only if 4 is “large enough.” Alter- 
natively, if @, > (c + @;)/2, monopoly pricing for the 
type-8, consumers yields consumption by the type-, 
consumers. 
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7(q) = Bq + constant 


“T(q) = eq + constant 


Type-8, consumers’ 
indifference 
curves 


N riot rained 
indifference 
curve 


>q 
D,(p) D,(p) D,() 

Figure 3.10 

Exercise 3.4 


Figure 3.10 depicts the issue. 

Suppose that the monopolist offers, along with the 
linear tariff T(q) = pq. the two-part tariff T(q) = A + pq. 
where 


c<p<p 
and 
pDy\p) = A + pDy\p). 


The second condition says that the type-@, consumers 
can also afford their initial bundle with the two-part tariff. 

It is clear from figure 3.10 that the type-@, consumers 
stick to their old consumption pattern (ie, they use the 
linear tariff). The type-, consumers are made better off 
because their opportunity set (the lower envelope of the 
linear and two-part tariffs) is improved. They increase 
their consumption by 


Dx p) — Dy(p) 


The monopolist enjoys this increase in consumption, be- 
cause his profit margin on these extra units is (7 — c) > 0. 

More formally, the increase in the monopolist’s profit 
from type-(, consumers is 


(1 — AIA + (fp — ODP) — (p — oD p)| 
= (1 — Ap — 3|D3(p) — Dap) > 0. 


70. Hert we follow the analysis of Kiem and Saft (1984) 


104 


The picture changes dramatically when the consumers’ 
demands are interrelated (as is the case when consumers 
are firms competing on the same product market). Pur- 
chases by low-demand consumers may be reduced by the 
discount offered to high-demand consumers, and there 
may exist no nonlinear tariff that is Pareto superior to a 
uniform price (as has been shown by Ordover and Panzar 
[1980]). 


Exercise 3.5 


(i) In the absence of tie-ins, the monopolist can either 
sell to both types and capture only the low-demand 
type’s surplus (profit S,(c)) or sell to high-demand con- 
sumers only (profit (I — 4)53(c)). 

(ii) For these numerical values: Under prohibition, 


5,0 < (1 — 4)5,(0. 
Aggregate welfare is equal to 
(1— S210. 


With tie-ins, the manufacturer prefers to sell to both 
types. as 


Sip) + (p — SD(p) > (1 — 4)S3(c), 


where p is the optimal price computed in the text (p = 3); 
welfare is equal to 


Z5\(p) + (1 = 4)S3(p) + (p — c)D(p) > (1 — A)Sy(c), 
as S3(p) > 5,(p). 
Exercise 3.67° 


Chicken Delight claimed that the tie was motivated by 
quality-control considerations. However, it was not tying 
the purchase of chicken, a more likely target of quality 
control; furthermore, repeat sales mattered a lot to fran- 
chisees, so their incentive to buy leaky buckets was weak, 

Suppose that a franchisee can serve either of two types 
of geographical area. The first type is composed of con- 
sumers of single dinners with a fairly high reservation 
price; the second type is composed of families purchas- 
ing bulk orders and having a lower reservation price 
(because home meal preparation is more common). Chic- 
ken Delight may have wanted to extract the extra profit 
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from franchisees serving the first type of area. (It could, 
alternatively, have imposed franchise fees tailored to the 
geographical site, but such “discrimination” between re- 
tailers is often considered illegal.) Chicken Delight could 
have used the relative price of buckets and kits to price- 
discriminate. 

In favor of the price-discrimination hypothesis, note 
that “single dinners were, on a per piece of chicken basis, 
priced higher than bucket sales” and that “the implied 
royalty payment of a typical Chicken Delight store sell- 
ing mostly chicken dinners was significantly higher than 
a typical store selling mostly bulk orders” (Klein and Saft 
1984, p. 11). 


Exercise 3.7 
(i) Define 
= =i, 


Vig.) = [1 — (1 — 9)? )/2 + 6g, 


ee > 0, 
ieee eae 


a 


=1>090, 
and 
ee 
&q 


Also define F(#) = 1 — G(—@) and f(A) = gi—@) for 0 
in (—x, 0] 

(ii) p(q) = | — q + @ results from consumer optimiza- 
tion. Furthermore, equation 3.14’ yields 


1 — F(@) 


septa AO 
RGA 2 ay 


(iii) Using the previous two equations and the specific 
functional form, we obtain 
pig) =1—-—q—-t 
pq) =c + t/x. 


Eliminating ¢ yields 


9) = (Je + (Ja - 
PRN Tee ( a 


Thus, p(q) is decreasing in q 


Exercise 3.8 


(i) The efficient quantity for high-demand consumers is 
4 = 2, because the marginal utility of the second unit is 


8,(V(2) — V(1)) =3 >e =}. 


That for low-demand consumers is q = 1 (the second unit 
actually yields a zero social surplus). Pure commodity 
bundling involves offering only the two-unit package at 
price 0, V(2) = 3. This yields a profit equal to 


(1—ag- 


To serve both consumers, the monopolist charges T(1) 
= 6, V(1) = Land T(2) so as to satisfy 


8, V(2) — T(2) = @, V1) — T(1). 
Thus, T(2) = }. Total profit is 
AQ — 3) + (1 — AG — 4) = 4/4 + (1-4 


Hence, pure commodity bundling is preferred to mixed 
commodity bundling if and only if 

21— A> 4/4 4+ (1-4), 

ord >/ 

(ii) If the monopolist chooses the size-2 technology, he 
obtains 2(1 — 4) (unless he charges a low price so as to 
serve the low-demand consumers. but this policy, which 
yields $, is easily seen not to affect our result). If the 
monopolist chooses the size-1 technology, he has the 
choice between four prices p per unit, which are the 
four marginal utilities of units (two per type of buyer). If 
p = 1, the profit is 


ad —3)4+ 1 = 4(2 —3) = 4/44 — 4/2. 


If p = 3, the low-demand buyers buy two units, but, as 
p =, the monopolist makes no profit. If p = 2, only the 
high-valuation buyer buys, and still just one unit. The 
profit is 


(1 —a)(2—3) = 5(1 — 4/4, 


which is clearly dominated by the profit for a size-2 


package. If p = 3. only the high-valuation consumers 


buy, and they buy two units. The profit is 


(1 — 4)(3 — 3) = 311 — 4)/2, 


which is also dominated. Hence. the size-2 technology is 
chosen if and only if 


Price Discrimination 


2(1 — A) > A/4 + (1 — A)/2. 
or$>2. 


Exercise 3.10 


By selling film 1 separately, the monopolist gets max(4, 
2 x 3) =6. By selling film 2 separately, he gets max(2, 
2 x 1) =4. Thus, the total profit is 8. By bundling the 
two, he gets 2 x 5 = 10, because the total willingness to 
pay for each unit is equal to 5. 


166 


ES 
References 


Adams, W., and J. Yellen. 1976. Commodity Bundling and the 
Burden of Monopoly. Quarterly Journal of Economics 90: 475-498, 


Axell, B. 1977. Search Market Equilibrium. Scandinavian Journal of 
Economics 79: 20-40. 


Baron, D., and R. Myerson. 1982. Regulating a Monopolist with 
Unknown Costs. Econometrica 50: 911-930. 


Bénabou, R. 1986. Optimal Price Dynamics: Speculation and Search 
under Inflation. Ph.D. thesis, Massachusetts Institute of Technology, 


Blackstone, E. 1975. Restrictive Practices in the Marketing of Electro- 
fax Copying Machines and Supplies: The SCM Corporation Case. 
Journal of Industrial Economics 23: 189-202. 


Borch, K. 1968. The Economics of Uncertainty. Princeton University 
Press. 


Borenstein, S. 1985. Price Discrimination in Free-Entry Markets. Rand 
Journal of Economics 16: 380-397. 


Bowman, W. 1957. Tying Arrangements and the Leverage Problem 
Yale Law Journal 67: 19-36. 


Brown, S., and D. Sibley. 1986. The Theory of Public Utility Pricing. 
Cambridge University Press. 


Burdett, K.. and K Judd. 1983. Equilibrium Price Dispersion. Econo: 
metrica 51: 955-990, 


Burstein. M. 1960a. The Economics of Tie-in Sales. Review of Economics 
and Statistics 42: 68-73 


Burstein. M. 1900b. A Theory of Full-Line Forcing. Northwestem 
University Law Review 55: 62-95, 


Butters, G. 1977. Equilibrium Distributions of Sales and Advertising 
Prices. Review of Economic Studies 44; 465-491 


Chae. S. 1987. Subscription Television. Mimeo, Rice University. 


Champsaur, P.. and |.-C_Rochet. 1986, Multiproduct Duopolists. 
Mimeo, ENSAE. 


Chiang, R.. and C Spatt. 1982. Imperfect Price Discrimination and 
Welfare. Rewnew of Econornic Studies 49: 155-181 


Crandall, R, 1968. The Decline of the Franchised Dealer in the Auto- 
mobile Repair Market. Journal of Business 43: 19-30. 


Debreu, G. 1959. The Theory of Value. New York: Wiley: 


Diamond, P. 1971, A Model of Price Adjustment. Journal of Econonuc 
Theory 3: 156-168 

Diamond. P, 1987. Consumer Differences and Prices in a Search 
Model. Quarterly Journal of Economics 102: 429-436. 


Dupuit, J. 1849. On Tolls and Transport Charges. Translated in Inter- 
national Economic Papers (London: Macmillan, 1952); original version 
in Annales des Ponts et Chaussées 17. 


Chapter 3 


Ekelund, R. 1970. Price Discrimination and Product Differentiation in 
Economic Theory: An Early Analysis. Quarterly Journal of Economics 
84: 268-278. 

Engers, M, 1987. Signalling with Many Signals. Econometrica 55: 663— 
674. 


Goldman, M., H. Leland, and D. Sibley. 1984. Optimal Nonuniform 
Pricing. Review of Economic Studies 51: 305-320. 


Guesnerie, R., and J.-J. Laffont, 1984. A Complete Solution to a Class 
of Principal-Agent Problems with an Application to the Control of a 
Self-Managed Firm. Journal of Public Economics 25: 329-369. 


Hausman, J,, and J. Mackie-Mason. 1986. Price Discrimination and 
Patent Policy. Working paper, University of Michigan. 

Joskow, P. 1985. Mixing Regulatory and Antitrust Policies in the 
Electric Power Industry; The Price Squeeze and Retail Market Com- 
petition, In Antitrust and Regulation: Essays in Memory of John J 
McGowan, ed. F. Fisher. Cambridge, Mass.: MIT Press. 


Katz, M, 1983, Nonuniform Pricing, Output and Welfare under 
Monopoly. Review of Economic Studies 50: 37-56. 


Katz, M, 1984a. Firm-Specific Differentiation and Competition among 
Multiproduct Firms. Journal of Business 57: 149. 


Katz, M, 1984b. Price Discrimination and Monopolistic Competition. 
Econometrica 52: 1453-1471. 


Katz, M, 1987. The Welfare Effects of Third Degree Price Discrimina- 
tion in Intermediate Goods Markets. American Economic Review 77: 
154-167. 


Kohllepel, L. 1983. Multidimensional “Market Signalling” Mimeo, 
Universitat Bonn. 

Klein, B., and L. Saft. 1984. Tie-in Contracts as Franchising Quality 
Control Mechanisms. Mimeo, University of California, Los Angeles. 
Laffont, |., and |. Tirole 1986. Using Cost Observation to Regulate 
Firms. Journal of Political Economy 94: 614-641. 

Laffont, |.-[, E, Maskin, and |.-C. Rochet, 1982. Optimal Nonlinear 
Pricing with Two-Dimensional Characteristics. Mimeo. 


Lewbel, A. 1985, Bundling of Substitutes or Complements. interna- 
tional Journal of Industrial Organization 3: 101-108. 


McAfee, P... |. McMillan, and M. Whinston. 1987. Multiproduct 
Monopoly, Commodity Bundling and Correlation of Values. Discus- 
sion Paper 1296, HIER, Harvard University. 


Maskin, E., and J. Riley. 1980. Auctioning an Indivisible Object. 
Working Paper 87D, Kennedy School of Government. Harvard 
University. 


Maskin, E.. and J. Riley 1984. Monopoly with Incomplete Information. 
Rand Journal of Economics 15: 171-196. 


Matthews, S,, and J. Moore. 1987. Monopoly Provision of Quality 
and Warranties: An Exploration in the Theory of Multidimensional 
Screening, Econometrica 55: 441-468. 


167 


Milgrom, P.. and R. Weber, 1982. A Theory of Auctions and Com- 
petitive Bidding. Econometrica 50: 1089-1122 


Mirman, L., and D. Sibley. 1980. Optimal Nonlinear Prices for Multi- 
product Monopolies. Bell Journal of Economics 11: 659-670. 


Mirrlees, J. 1971. An Exploration in the Theory of Optimum Income 
Taxation. Review of Economic Studies 38: 175-208. 


Mussa, M., and S. Rosen. 1978, Monopoly and Product Quality. 
Journal of Economic Theory 18: 301-317. 


Myerson, R. 1979. Optimal Auction Design. Mathematics of Operations 
Research 6: 58-73. 


Oi, W. Y. 1971. A Disneyland Dilemma: Two-Part Tariffs for a Mickey 
Mouse Monopoly. Quarterly Journal of Economics 85: 77-90 


O'Keefe, M. 1981, Quality and Price Discrimination. Ph.D, thesis, 
Harvard University. 


Ordover, J., and J. Panzar. 1980. On the Nonexistence of Pareto 
Superior Outlay Schedules. Bell Jounal of Economics 11: 351-354. 


Oren, S., S. Smith, and R. Wilson. 1982. Linear Tariffs with Quality 
Discrimination, Bell Journal of Economies 13: 455-471. 


Oren, S., S. Smith, and R. Wilson. 1983. Competitive Nonlinear Tariffs. 
Journal of Economiic Theory 29: 49-71 


Oren, S., S. Smith, and R. Wilson. 1984. Pricing a Product Line. Journal 
of Business $7, no. 1: S73—S100. 


Oren, S.,§. Smith, and R. Wilson. 1985, Capacity Pricing. Econametrica 
53: 545-566. 


Pangar, ],, and D. Sibley. 1978. Public Utility Pricing under Risk: The 
Case of Self Rationing. American Eronomuc Review 68: 888-895 


Perry, M. 1978. Price Discrimination and Forward Integration: Bell 
Journal of Economics 9- 209-217 


Philips, L. 1983. The Economics of Price Discrimination. Cambridge 
University Press, 


Pigou, A.C 1920. The Economics of Welfare, fourth edition, London: 
Macmillan. 


Png. L. and J. Hirshleifer. 1987. Price Discrimination through Offers 
to Match Price. Journal of Business 60: 363-384, 


Posner. R. 1976. The Robinson-Patman Act: Federal Regulation of Prive 
Dhfferences. City: American Enterprice Institute. 


Quinzii, M.. and j.-C_ Rochet, 1985. Multidimensional Signalling. 
Journal of Mathematical Economics 14: 261-284. 


Reinganum, J. 1979. A Simple Model of Equilibrium Price. Journal of 
Political Economy 87: 851-858. 


Reitman, D. 1986, Competition in Congested Markets. Ph.D. disserta 
tion, Graduate School of Business, Stanford University, 


Riley, J., and W. Samuelson. 1981. Optimal Auctions. American 
Economic Review 71; 381-392, 


Price Discrimination 


Rob, R. 1985. Equilibrium Price Distributions. Review of Economic 
Studies 52: 487-504. 


Roberts, K. 1979. Welfare Implications of Nonlinear Prices. Economic 
Jounal 89: 66-83. 


Robinson, J, 1933. Economics of Imperfect Competition. London: 
Macmillan. 


Rosenthal, R. 1980. A Model in which Increase in the Number of 
Sellers Leads to a Higher Price. Econometrica 40: 1575-1579. 


Rothschild, M., and J. Stiglitz. 1976. Equilibrium in Competitive 
Insurance Markets: An Essay on the Economics of Imperfect Informa- 
tion. Quarterly Journal of Economics 90: 629-650, 


Salop. S. 1977. The Noisy Monopolist: Imperfect Information, Price 
Dispersion, and Price Discrimination. Review of Economic Studies 44: 
393-406. 


Salop, S., and J. Stiglitz. 1982. A Theory of Sales: A Simple Model of 
Equilibrium Price Dispersion with Identical Agents. American Economie 
Review 72: 1121-1130. 


Sappington, D. 1982. Optimal Regulation of Research and Develop- 


ment under Imperfect Information. Bell Journal of Economics 13. 354— 
368. 


Schmalensee, R, 1981. Output and Welfare Implications of Monopo- 
listic Third-Degree Price Discrimination. American Economic Review 
‘71: 242-247, 

Schmalensee, R. 1982a. Monopolistic Two-Part Pricing Arrangements 
Bell Journal of Econamics 12: 445-466, 


Schmalensee, R. 1982b. Commodity Bundling by a Single Product 
Monopolist. Journal of Law and Economics 25:67—71 


Schmalensee. R. 1984, Gaussian Demand and Commodity Bundling. 
Journal of Business 57: $211-S230 


Seade, J. 1977, On the Shape of Optimal Tax Schedules. Jowmial of 
Public Economucs 7: 203-235. 


Shilony. Y. 1977. Mixed Pricing in Oligopoly. oumnal of Economic 
Theory 14: 373-388. 


Spence, A. 1977. Nonlinear Prices and Welfare. Journal of Public 
Economucs 8: 1-18. 


Spulber. D. 1981 Spatial Nonlinear Pricing. American Economic Review 
TL 923-933 


Stigler, G. 1963. A Note:on Block Booking. Supreme Court Review. 
Reprinted in The Organization of Industry (University of Chicago Press, 
1983) 


Stiglitz, J. 1977. Monopoly, Nonlinear Pricing. and Imperfect Informa- 
tion: The Insurance Market, Review of Economic Studies 44: 407-430. 


Stiglitz, J. 1985. Competitivity and the Number of Firms in a Market: 
Are Duopolies More Competitive than Atomistic Markets? Technical 
Report 478, TMSSS, Stanford University. 


168 


Telser, L. 1979. A Theory of Monopoly of Complementary Goods. 
Journal of Business 52: 211-230. 


Thisse, J-F., and X. Vives. 1986. On the Strategic Choice of Spatial 
Price Policy, Mimeo, University of Pennsylvania. 

Varian, H. 1980. A Model of Sales. American Economic Review 70, 
651-659. 


Varian, H. 1985. Price Discrimination and Social Welfare. American 
Economic Review 75: 870-875. 


Varian, H. 1987. Price Discrimination. In Handbook of Industrial 
Organization. ed. R. Schmalensee and R. Willig (Amsterdam: 
North-Holland, forthcoming), 


von zur Muehlen, P. 1980. Monopolistic Competition and Sequential 
Search. Journal of Economic Dynamics and Control 2: 257-281. 


Willig, R. 1978. Pareto-Superior Nonlinear Outlay Schedule. Bell 
Journal of Economics 9: 56-69. 


Wilson, C. 1977. A Model of Insurance Markets with Incomplete. 
Information. Journal of Economic Theory 16: 167-207. 


Wilson, R. 1985. Economic Theories of Price Discrimination and Product 
Differentiation: A Survey. Parts I, Il, Ill. Mimeo, Graduate School of 
Business, Stanford University. 


Wilson, R, 1986. Efficient and Competitive Rationing via Priority 
Service, Mimeo, Graduate School of Business, Stanford University. 


Chapter 3 


Vertical Control 


In chapters I through 3 we assumed that the firm 
under discussion supplied final consumers directly, and we 
focused on monopoly pricing and product selection. We 
will now study the relationships between an “upstream 
firm” possessing monopoly power in an intermediate- 
good market and the users of that good: the “downstream 
firms.” In these situations, the monopolist does. not sup- 
ply the final users himself. Examples of downstream firms 
include manufacturing or service companies using an in- 
termediate input, wholesalers, and retailers (in the latter 
two cases, the final good is often close or identical to 
the intermediate one). Because downstream firms are cus- 
tomers of the upstream firms, many of the features 
studied previously are also relevant to the study of ver- 
tical relationships. For instance, the upstream firm may 
want to discriminate among the downstream firms, either 
with regard to the sectors or areas they serve or with 
regard to their cost structures. 

However, vertical relationships among firms are often 
much richer and more complex than those between a firm 
and its consumers. Ordinary consumers often just con- 
sume the good, but industrial consumers (downstream 
firms) transform the good and/or market it. In other 
words, some further decisions (technological use, deter- 
mination of final price, promotional effort, and so forth) 
are made after the intermediate good is sold by the up- 
stream firm. Because these decisions affect its profit, the 
upstream firm has an incentive to control them. Beyond 
the pricing policy and the product specification for its 
good, it will exert further vertical control on downstream 
operations to the extent that such control is feasible. For 
instance, it may fix the final (retail) price for the good, 
delineate the area of distribution of each retailer, or im- 
pose tie-in purchases of other goods. 

Vertical control was touched on in the chapter on the 
theory of the firm. In this chapter, we abstract from 
the issues of transaction costs, incomplete contracts, and 
ownership, and other issues that are crucial even in a 
world of perfect product-market competition. We focus 


on the monopoly reasons for vertical control; that is, 
on the factors that create an incentive for vertical con- 
trol only when the market for the intermediate good is 
noncompetitive. 

A school of thought sometimes associated with the 
University of Chicago holds that there is no monopoly 
reason for vertical control, and that observed vertical 
controls are meant only to improve the efficiency of real- 
world vertical relationships and not to exercise monopoly 
power on the intermediate-good market. A firm that has 
monopoly power in its market, so runs the argument in its 
simplest form, can always exhaust this monopoly power 
by raising the linear price it charges its customers. Hence, 
vertical control is purely an internal, welfare-enhancing 
matter within the vertical structure (ie. it does not hurt 
third parties, such as consumers). We will consider the 
validity of this argument below. 

To conform to the industrial-organization tradition, we 
will say that the upstream firm is vertically integrated if it 
controls (directly or indirectly) all the decisions made by 
the vertical structure. The “vertically integrated profit" is 
the maximum aggregate (manufacturer's plus retailers’) 
profit that the vertical structure can obtain—that is, the 
aggregate profit that the structure would get if all the 
decision variables were costless to observe, verify, and 
specify in the contract. The vertically integrated solution 
is a particularly useful benchmark because it demonstrates 
which decisions the monopolist would like the down- 
stream firm to make concerning matters he cannot directly 
control. Readers who have skipped the chapter on the 
theory of the firm are warned that the term vertical inte- 
gration as used in this chapter may be misleading. On the 
one hand, a monopolist who absorbs the downstream 
firms (the usual meaning of vertical integration) may not 
be able to exert total control over these firms (because 
decisions must be delegated within a vertical structure, be 
it integrated or not). On the other hand, full vertical 
control may sometimes be achieved in the absence of 
vertical integration through adequate contracts specifying 
“vertical restraints,” as we will see below. 

The chapter proceeds as follows. Section 4.1 defines a 
basic vertical framework, which is used to introduce the 
most commonly observed vertical restraints. Section 4.2 
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then presents the control problem. The need for control is 
traced to the existence of externalities between down- 
stream firms and the upstream firms, or among down- 
stream firms themselves. (The use of vertical control to 
implement price discrimination has been studied in chap- 
ter 3, and will not be developed here.) The emphasis here 
is on the way vertical restraints are used to cope with 
these externalities.’ Sections 4.3 and 4.4 deal with intra- 
brand and inter-brand competition. The supplementary 
section treats the effect of uncertainty on vertical relation- 
ships and the virtues of downstream competition in pro- 
moting economic efficiency and offers a more general 
analysis of market foreclosure. 


SN 
4.1 Linear Prices versus Vertical Restraints 


Much of economic theory is concerned with the case 
of linear prices, in which the buyer pays the seller an 
amount proportional to the quantity bought. Vertical re- 
Jationships, however, often involve more complex con- 
tracting arrangements, broadly named vertical restraints. 
They range from simple nonlinear prices (for example, the 
imposition of franchise fees, as considered in chapter 3) to 
instruments that restrict intra-brand or inter-brand com- 
petition (such as exclusive territories and exclusive deal- 
ing). To introduce some common vertical restraints in a 
natural way. we start with a basic model, which we enrich 
progressively. 


4.1.1 Basic Framework 


A single supplier, called the monopolist or the manufac- 
turer, produces an intermediate good at a constant unit 
cost, c. He is the only producer of this good, and he sells 
it to a single downstream firm, called the retailer. (The 
downstream firm could equally well be a wholesaler or an 
industrial user of the intermediate good.) The retailer re- 
sells the product; for simplicity, he has no retailing cost. 
Formally, after signing the contract, the retailer has a 
monopoly on a technology that transforms one unit of 
the intermediate good into one unit of the final good. 
Pw denotes the wholesale (intermediate) price and p the 
consumer (or retail or final) price. q denotes the quan- 


i. Sections 41 and 42 draw from Rey and Tirole 1986a. Blair and Kaserman 
(1983) and Caves (1984 survey a different selection of topes 
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tity bought by the retailer; it also denotes the final con- 
sumption if the retailer does not throw away any of the 
intermediate good. The consumers’ downward-sloping de- 
mand function is denoted q = D(p). (We will later assume 
that demand also depends on a promotional service s 
exerted by the retailer: g = D(p,s).) 

Some of the most common forms of contracting be- 
tween manufacturers and retailers are defined as follows: 

A linear price is a contract specifying only a payment, 
T(q) = pwq. from the retailer to the manufacturer. q is the 
retailer's choice (see figure 4.1). 

A franchise fee, A, gives rise to the simplest example of 
a nonlinear price (or payment function). The retailer then 
pays T(q) = A + pyq. As in chapter 3, the manufacturer 
may more generally be able to charge a more complicated 
(fully nonlinear, i.e. non-affine) payment. 

Resale-price maintenance (RPM) is a provision in the 
contract dictating the choice of the final price, p, to the 
retailer. Variants of this restraint are a price ceiling (p <p) 
and a price floor (p > p). (RPM is thus a price ceiling plus 
a price floor, such that p = p.) 

Quantity fixing specifies the amount, q, to be bought by 
the retailer. Variants of this restraint are quantity forcing 
(q > q) and quantity rationing (q <q). If demand is known 
and depends on the final price only, and if the retailer 
carmot throw the good away, quantity forcing is equiv- 
alent to a price ceiling and quantity rationing to a price 
floor (and quantity fixing to RPM). 


Why have industrial-organization theorists focused on 
such primitive restraints, and when can these restraints be 
imposed? The most obvious cause of the focus is that 
these restraints are simple and commonly used. But also, 
they may not be as primitive as they look in the environ- 
ments in which they have been studied. 

Consider first a deterministic environment (see sec- 
tion 4.2 for more details). The manufacturer's concern 
is to ensure that the retailer picks the “right actions” 
(for instance, final price or promotional effort)—which, 
because of the absence of uncertainty, are known, The 
retailer's decision is, as we will see, generally dictated 
by the marginal price he pays for the intermediate good. 
In a deterministic environment, however, the amount of 


Retailer 


Figure 4.1 


the intermediate good consumed—and, thus, its marginal 
price—can be foreseen perfectly. Thus, there is no loss 
in adopting a two-part tariff (ie., a franchise fee plus a 
fixed marginal price)—at least, there is no loss if the 
retailer's objective function is concave—so there is no 
point in considering more complex nonlinear prices. This 
vindicates the focus on franchise fees 

The above justification of two-part tariffs does not 
hold in a stochastic, asymmetric information environ- 
ment. As is well known in regard to adverse selection? 
and moral hazard,* a constant marginal price is not gen- 
erally desirable. Thus, the manufacturer may wish to use 
more complex nonlinear prices. However, arbitrage may 
prevent him from doing so (see chapter 3). Although it 
is easy to control the quantity bought directly by the 
retailer, it is much harder to observe the quantity he 
actually sells. Consequently, if there are several retailers 
(e.g.. in different geographical markets), some retailers 
may engage in “bootlegging” to other retailers, thereby 
preventing total price discrimination by the monopolist. 
The conventional result is that, with many arbitraging 
buyers, the upstream unit can only charge linear prices. 
In the present context, however, the manufacturer is gen- 
erally assumed to observe whether the retailer carries his 
product; thus, he can demand the payment of a franchise 
fee (as long as the courts confirm this right). Thus, two- 
part tariffs may be used despite arbitrage. 


2. See chapter 3 of the present book. See Baron and Myerson 1982, Sapping: 
ton 1982, and Laffont and Tirole 1986 for examples in a regulatory context 
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3. See, e.g., Holmstrom 1979, Shavell 1979. and Grossman and Hart 1983 
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This brings us to a seemingly trivial but actually im- 
portant point: The set of vertical restraints that can be 
used in practice depends on the informational environ- 
ment—i.e., on what can be observed and enforced by the 
manufacturer. (If the enforcement mechanism is associated 
with the legal system, the courts must also be able to 
verify the manufacturer's information.) Thus, for exam- 
ple, RPM is not possible if the retailer can give hidden 
discounts to his customers.* Similarly, quantity fixing is 
essentially meaningless in an environment in which the 
retailers engage in arbitrage. 


4.1.2 Intrabrand Competition 


Let us now introduce the possibility of competition among 
several retailers on the same market. The new type of 
restraint that can be used by the manufacturer is that of 
exclusive territories, which divides the final market among 
the retailers (figure 4.2). (A similar restraint is a limit on 
the density of retailers.) 

Territories can be understood in a spatial sense, but 
also (more broadly) in a market-segmentation sense (for 
instance, public versus private markets). The informa- 
tional requirements for such a restraint to be feasible are 
strong, For example, in the spatial interpretation of the 
model, the manufacturer must be able to trace customers* 
and to prove (in case of cheating) that the retailer was 
aware of their place of origin (or, if he was not cheating, 
that he was negligent in not obtaining that information) 
Thus, exclusive territories are more commonly used when 
the downstream units are wholesalers. However, note 
that the allocation of a retail monopoly situation (in an 
isolated territory) serves the purpose of exclusive terri- 
tories. A similar remark can be made about refusals to 
deal. 


4. Discounts may affect nonmonetary (less observable) dimensions of exchange 
between retailers and customers. as well. For instance. they can take the form 
of extra services or free delivery. Also even if discounts can be observed by the 
manufacturer. such price controls may be prohibitively costly Suppose that one 
of the roles of the retailers 1s to analyze customers so as to price discriminate 
among them. (The manufacturer knows only the distribution of tastes in the 
population of customers! A full control of the retailer's pricing policy then 
Tequires knowing, ex post the whole distribution of prices that he charged. which 
is very costly for the manufacturer or a court to assess. (In other words 
monitoring costs cannot be saved by inspecting randomly.) 


5, Methods that can be employed to do so include the use of private investiga- 
tors and the existence ot warranty cards ar discount coupons that must be sent 
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Manufacturer Manutacturer 
—> 
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i) 
Figure 4.2 


Exclusive territories. 
4.1.3 Several Inputs 


Let us assume that the downstream unit uses several 
inputs to produce the final good. Here, the downstream 
unit can be a producer. It can also be a retailer who sells 
complementary products to the customer. A new restraint 
specific to this feature is a tie-in, in which one of the 
input suppliers forces the downstream unit to purchase 
the other inputs from him. (To be precise, we should 
distinguish between “bundling,” which fixes the quan- 
tities of other inputs per unit of manufacturer input, and 
“requirements contracting,” in which the manufacturer 
simply requires that the retailer buy the other inputs 
from him. The distinction matters under uncertainty.) And 
intermediate products are thus tied. In particular, he can 
charge prices for the other inputs that differ from their 
market prices.* 

The manufacturer can use another instrument if the 
retailer's level of sales is observable and verifiable: He can 
impose a payment, called a royalty, proportional to the 
number of units sold downstream.” 


back to the manufacturer with the customers address wniten in, The same 
methods may be used to enforce RPM 


©, Making a product incompatible with complementary products manufactured 
by other firms (but not with one’s own complementary products. of course) is 
similar to a tie-in. 


7. A royalty could be imposed in the single-input case, However, this instru 
ment is redundant with the wholesale price if, as we have assumed here, one 
unit of input is transformed into one unit of output. If p, and r denote the 
wholesale price and the royalty rate, the effective marginal cost for the retailer 
isp. + r. Hence. anything that can be done with a royalty rate can also be done 
without 2 royalty but with a higher wholesale price In general, nonlinear 
royalty payments ate superior to linear ones. as was suggested in chapter 3 
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Figure 4.3 
Example of a tie-in 


4.1.4 Interbrand Competition 


The retailer may sell goods that are close substitutes for 
the good supplied by the manufacturer. The manufacturer 
may then impose exclusive dealing on the retailer, which 
prevents him from selling goods that compete directly 
with the manufacturer's product.* 


This does not exhaust the list of possible contractual 
provisions between manufacturer and retailer, which are 
often dependent on the environment. For instance, if the 
manufacturer is in charge of national advertising for the 
product, the contract may include a provision concerning 
the expenses of the advertising. 


4.1.5 Legal Status of Restraints 


‘A few words about the always-changing legal status of 
these restraints in the United States are necessary. Rough- 
ly, franchise fees are legal; indeed, their legality might 
lead one to reserve the term vertical restraint for other 
restraints. RPM is currently illegal per se. Exclusive terri- 
tories, after having been forbidden per se, are now judged 
according to a rule of reason. Tie-ins are, in principle, 
illegal per se, but their actual status is closer to being 
determined by a rule of reason. 


8. Another (and a very different) degree of freedom concerning interbrand 
competition is the length of contracts, or the level of penalties for breach of 
contract. See the supplementary section. 


9. They directly affect only “internal” transfers. They may indirectly affect 
targets through incentives, but this is irrelevant to the present classification 


10. Furthermore. this assumption is innocuous as long as the contract can 
specify a franchise fee and there 1s no uncertainty. To see this. define a “con 
strained efficient contract” as a contract that maximizes the aggregate profit 
subject to the incentive constraints (decentralization of those decision variables 
that are targets, but not of instruments). In other words. a “constrained efficient 
contract" yields the maximum feasible aggregate profit (which. if the set of 
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4.2 Externalities and Vertical Control 
4.2.1 Methodology 


The vertical structure, as a whole, determines a number of 
(possibly dependent) decision variables: wholesale price, 
franchise fee, quantity purchased by the retailer, consumer 
price, promotional.effort, retail location, and so forth. In 
practice, only a few of these variables are observable and 
verifiable (in the sense of section 4.1); these variables are 
called instruments, They can serve as the basis for the 
monetary transfers in the contract between the manufac- 
turer and the retailer(s). Next we define targets. To this 
purpose, let us call the sum of the manufacturer’s and the 
retailers’ profits the aggregate profit. The targets form an- 
other subset of decision variables, which are those direct- 
ly affecting the aggregate profit. The promotional effort 
and the retail price are targets. The franchise fee and 
the wholesale price are not targets, because they do not 
directly affect the aggregate profit? The control prob- 
lem consists in knowing how to use the instruments to 
reach, or come close to, the desired values of the targets 
—that is. the values that maximize the vertical struc- 
ture's aggregate (vertically integrated) profit. The litera- 
ture actually often looks at situations where there are 
“enough” instruments to obtain the vertically integrated 
profit; Mathewson and Winter (1984, 1986) say that the 
set of instruments is then sufficient 

In what follows we will, for simplicity, assume that the 
manufacturer chooses the contract. The retailers accept 
the contract only if it guarantees them at least what they 
would get by refusing it. We normalize this “outside 
opportunity” to be zero; thus the retailers accept the 
contract only if it gives them a non-negative profit. The 
assumption that the manufacturer chooses the contract 
makes sense when there exisis a competitive supply of 
potential retailers.'° 


instruments 1s sufficient. comcides with the vertically integrated profit) The 
bargaining between the manufacturer and the retailers over the contract can 
then be decomposed into twa steps: the design of a constrained efficient 
contract (which yields the greatest possible “pie") and the division of the pie 
through the franchise fee. Because the franchise fee is a lump-sum transler. it 
does not interact with the determination of the targets ‘Thus, even if we have 
little information about the parties’ relative bargaining power (which deter 
mines the franchise fei, we can characterize the optimal contract (up tu. the 
franchise component! In the absence of franchise fees, the maximum teasible 
aggregate profit may not be reached, and the complete outcome (and not only 
the division of profits) is. in general, sensitive to the parties bargaimung power 
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4.2.2 The Basic Vertical Externality 


In chapter 1 we saw that under linear pricing a monopo- 
list charges a price above marginal cost. Doing so allows 
him to realize a positive profit margin (at the expense of 
a contraction in demand). Similarly, in the context of a 
vertical structure, the monopoly producer of an inter- 
mediate good who uses linear pricing charges p, > c. The 
retailers then face a marginal cost for their input (the 
intermediate good) equal to p,, and they make their 
pricing, promotional, and technological decisions on this 
basis. The vertical externality is that any decision made 
by a retailer that increases his demand for the inter- 
mediate good by one unit generates an incremental profit 
of py —c for the manufacturer. However, the retailer, 
who maximizes his own profit, does not take the manu- 
facturer's incremental profit inte account, and therefore 
tends to make decisions that lead to too low a consump- 
tion of intermediate good. The problem is that the re- 
tailer's cost (p,) for the good differs from the vertical 
structure's (c). The aggregate profit is then lower than the 
vertically integrated one, which gives the manufacturer 
an incentive to impose vertical restraints that eliminate 
this externality. 

Three famous illustrations of this basic externality in 
the simple manufacturer-and-retailer context deal with the 
downstream unit's choice of price, promotional effort, and 
production technology respectively. 


Example 1: Double Marginalization (Spengler 1950) 


Suppose that the retailer's only decision (and, therefore, 
the vertical structure's only target) is the retail price. The 
vertically integrated quantity q™ and retail price p™ are 
determined by 


q™ = D(p™), 


and p™ maximizes (p — c)D(p), where D(-) is the demand 
curve 

Consider the decentralized structure and the retailer's 
choice of the consumer price p under a linear wholesale 
tariff T(q) = p,q. Assume that the manufacturer chooses 
the linear tariff first and the retailer chooses the consumer 
price second, and that the retailer is himself a monopolist 
in his retail market 

The retailer maximizes his own profit, (p — p)D(p).In 
chapter | we saw that the monopoly price is an increasing 


174 


Py >e 


p=P™(p,) > p™c) 


v 


Figure 4.4 
Double marginalization. 


function of marginal cost. Because the retailer is a mo- 
nopolist and because his marginal cost is equal to p,,, we 
have p > p™ as long as the manufacturer charges above 
the marginal cost (py > c). The retail price is higher in 
the decentralized structure than in the integrated one, 
because of two successive mark-ups (marginalizations). As 
mentioned earlier, the externality arises because the re- 
tailer does not take the manufacturer's marginal profit, 
(pm — ©)D'(p), into account when choosing a retail price. 

To see this, assume that the final-demand function is 
D(p) = 1 — pand that c < 1. Let I, and I, denote the 
manufacturer's and the retailer's profits. First determine 
the equilibrium for the nonintegrated industry. The re- 
tailer solves 


max [(p — py)(1 — pyl, 
4 


from which it follows that 


_itp 
= 2 


p 


Then, the demand for the final good (and, therefore, that 
for the intermediate good) is 
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The manufacturer solves 


1— Pw 
a(S") 
neo 
from which it follows that 


_ bt 
= 


Pw 


Notice that 


t=) b= 


ni x = 2in— 3? 
1" =11,, + 11, ae £(1 — o) 
and that 


_3t+e 
4 


Now consider the integrated industry, which pays c per 
unit for its input. It maximizes 


max |(p — c)(1 — p)]. 
” 


from which it follows that 


_ite 
cis 7 


Then the total profit is 


ne- (I1—c? sm 


Therefore, the integrated industry makes more profit than 
the nonintegrated industry, and the consumer price is 
lower in the case of the integrated industry. These two 
properties are very general, as we have seen. The objec- 
tive of vertical integration is to avoid the double price 
distortion that occurs when each firm adds its own price- 
cost margin at each stage of production. (“What is worse 
than a monopoly? A chain of monopolies.” 


Exercise 4,1°* Show that in the double-marginalization 
set-up the ratio of the retailer's margin over the manufac- 
turer's margin, 


’ 


Figure 4.5 


is equal to (greater than, lower than) 4 if the demand 
function is linear (convex, concave).!! (Hint: Look at the 
retailer's and the manufacturer's first-order conditions. To 
obtain the sensitivity of the retail price to the wholesale 
price, differentiate the retailer's first-order condition; use 
this first-order condition again to reach the conclusion.) 


If (as in figure 4.5) one of the two firms is competitive 
in the sense that it sells at marginal cost, then vertical 
integration does not increase the profit of the monopoly 
firm. The intuition behind this result is that the competi- 
tive sector does not introduce a price distortion. Thus, the 
monopoly sector does not exercise an externality on the 
competitive sector, whose price-cost margin is zero. (As 
we will see in example 3, the result does not hold if, for 
instance, the downstream competitive sector uses several 
inputs.) 


The double-marginalization (or chain-of-monopolies) 
problem is very similar to that of two monopoly producers 
of perfectly complementary goods. After all, production 
and retailing are complements, and consumers often con- 
sume both of them in fixed proportions. The following 
exercise shows, similarly, that the monopoly producers 
of complementary goods have an incentive to integrate 
(horizontally) in order to avoid double marginalization 
and an excessive demand contraction. 
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1. This exercise follows Bresnahan and Reiss 1985. See that article for a test 
of the relationship in the case of automobile dealerships 
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Exercise 4.2°* Two firms (i = 1,2) produce one good 
each, at marginal cost ¢; (i = 1, 2). Each firm has a monopo- 
ly power in the production of its good. The goods are 
perfect complements. The demand curve is q = D(p), 
where p = p, + pais the price of the composite good and 
p, is the price of good i (i = 1,2). Letc = c + &- 

(i) Reinterpret the variables to show that the case of a 
single good produced by a manufacturer and distributed 
by a retailer fits into this model. 

Assume that the elasticity of demand, ¢ = — D'p/D, is 
constant in order to simplify computations. 

(ii) What is the optimal p for the horizontally inte- 
grated structure? 

(iii) Consider the nonintegrated structure. Suppose that 
firm 1 chooses its price first and takes into account the 
effect of its choice on firm 2's price. Show that the Lerner 
index is higher than under integration. More precisely, 
show that 


p=cl — 1/e)? 


(iv) Suppose now that the two firms choose their prices 
simultaneously. Assume that each firm maximizes its pro- 
fit given the other firm's price—that is, the choices are 
simultaneous: a firm does not try to influence the other 
firm's price (see chapter 5). Show that the Lerner index is 
even higher than in the case of sequential choice of prices. 
More precisely, show that 


= cll — 2/8). 


Interpret this result. (Hint: Start from the simultaneous- 
choice equilibrium and change firm I's price slightly.) 


Sufficient Vertical Restraints 


Franchise fee The benefit of vertical integration is related 
to the extreme simplicity of the linear-price contract: 
Indeed, the manufacturer can realize the integrated pro- 
fit without integration by using a two-part tariff: T(q) 
= A+ pug. With a little thought it will be evident which 
marginal price p, he ought to choose. Recall that the 
issue with the linear price is that the downstream unit's 
marginal cost is not equal to the vertical structure's mar- 


ginal cost. To eliminate this distortion, take p, = c. The 
downstream unit does not affect the upstream unit by its 
choice of final price, so there is no externality, The retailer 
maximizes 


(p—c)D(p)—A 
and thus chooses 
p=p™ 

His profit is equal to 
n™— A, 


where 11™ = (p™ — c)D(p™). The manufacturer can then 
appropriate the retailer's profit by imposing a franchise 
fee equal to the vertical structure’s profit (A = I1™). 

The idea of charging a marginal price equal to the 
upstream marginal cost in order to avoid downstream 
distortion—an idea already encountered in chapter 3—is 
very general. This policy amounts to “selling the vertical 
structure” (at price A) to the downstream monopolist, 
who is made the “residual claimant” (the receiver of any 
marginal profit). The downstream monopolist therefore 
has all the incentive to make the “right decisions” (here, 
to choose the monopoly price). The two-part tariff also 
solves the extemalities in promotional effort and input 
choices considered below. 


Drawbacks of franchise fees A franchise fee is a simple and 
powerful instrument in this environment. However, in 
more complex environments a franchise fee can also have 
drawbacks. First. when the retailer is risk-averse and the 
retail cost or the final demand is random, the retailer— 
because he claims all the residual profits—bears too much 
risk. A reduction in the franchise fee, together with an 
increase in the wholesale price above marginal cost, gives 
the retailer some insurance and is called for.'? (See section 
4.6 for an analysis of risk sharing.) Second, suppose that 
at the contracting date the retailer possesses private infor- 
mation about the retail cost or the (local) final demand 
that the manufacturer does not have. Because the retailer's 
profit is not known to the manufacturer, it is difficult for 
him to tailor the franchise fee so as to appropriate the 


IZ. it the uncertainty 1s about final demand, the retailer clearly faces less risk 
When the wholesale price increases, because his margin per unit sold at a given 
price decreases with p, The reason this is true for retail-cost uncertainty is 
shghtly more subtle An increase in the wholesale price leads to an increase in 
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the final price—that is, fo a contraction im demand, As total retail cost 1s equal 
to unit retail cost times demand, the variability of total retail cost 1s reduced for 
a given variability in unit retail cost. 
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retailer's profit. The manufacturer must then use screen- 
ing devices similar to those analyzed in chapter 3. By 
analogy, we can derive the optimal discriminatory policy 
for the manufacturer: A low-retail-cost or a high-final- 
demand retailer here corresponds to a high-demand con- 
sumer in the theory of price discrimination; hence, the 
optimal two-part tariff has a wholesale price higher than 
the marginal cost (p,, > c) and a franchise fee equal to the 
retailer's profit when the demand is low or the retail cost 
is high.'? Third, when there are several retailers, a re- 
tailer is not in general made the residual claimant for the 
vertical structure by buying the input at marginal cost. 
Franchise fees, in general, will not suffice to realize the 
vertically integrated profit, as we will see later. 


Resale-price maintenance Rather than using a franchise 
fee, the manufacturer can sell the intermediate good at 
price p, = p™ and then impose resale-price maintenance 
at p = p™. The retailer then makes a zero profit, and the 
vertical structure's aggregate profit (which is equal to the 
manufacturer's profit) equals fT. Hence, RPM is a suffi- 
cient instrument here. Actually, a price ceiling (p < p™) 
—or, equivalently, quantity forcing (q > q™)—would 
also allow the manufacturer to realize the integrated pro- 
fit. The more commonly encountered version of price 
maintenance—a price floor—would not allow this (which 
suggests that there are other explanations for price main- 
tenance than the double-marginalization problem) 

Like the franchise system, RPM is not sufficient once 
one introduces uncertainty. For one thing, its insurance 
properties are poor when the retailer is risk-averse and 
faces retail-cost uncertainty; he is not able to pass retail- 
cost variations into the final price, so he (and not the 
manufacturer) bears all the risk of such variations. 


Welfare The welfare analysis of vertical integration (or, 
equivalently, of sufficient vertical restraints) is simple. The 
vertical structure (manufacturer plus retailer) makes more 


13, Conversely, if, at the contract date, the manufacturer has private informa 
tion about the aggregate demand for his product. he charges p. = ¢ when the 
demand is “low” and p, > c when the demand is “high” The intuition 1s that 
to "prove" that demand is high. the manufacturer accepts a cut in the franchise 
fee in exchange for a higher wholesale price. (The Spence-Mirrlees condition 
alluded to in chapter 2 says that, because a manufacturer with a high demand is 
more interested in the variable profit than one facing a low demand, higher 
demand can be signaled only through a higher wholesale price.) See the "fran- 


money under vertical integration than under a linear price 
because it realizes the vertical structure's monopoly pro- 
fit. Consumers are better off under vertical integration 
because they face a lower price. Thus, welfare is un- 
ambiguously increased by the elimination of the double 
marginalization. The same conclusion holds for the two 
additional illustrations of the basic vertical externality 
considered in examples 2 and 3 below. 


Example 2: Downstream Moral Hazard 


Retailers often provide services that make the manufac- 
turer's good more attractive to consumers: trading stamps, 
free alterations, free delivery, credit, pre-sale information, 
elaborate premises, excess sales help to keep waiting lines 
short, and so on. We can gather all these under the head- 
ing “promotional effort” or “services.” To the extent that 
promotional effort affects the demand for the good, the 
manufacturer wants to encourage the retailer to supply it. 
The simplest way to do so would be to specify the level 
of promotional services in the contract. But such a con- 
tract would generally not be enforceable, as courts (and 
even the parties) cannot measure such services precisely. 
Thus, incentives must be given to the retailer to over- 
come the associated moral-hazard problem.'* 

Promotional services can be formalized as a real num- 
ber, s. As in chapter 2, s is a parameter measuring the 
good’s position in a vertical product space. The con- 
sumers’ demand is q = D(p,s) (see section 2.1 for an 
example of the derivation of such a demand function). D 
decreases with p and increases with s. Assume that sup- 
plying a level s of services costs the retailer (s) per unit 
of output, and that this cost can be observed only by the 
retailer. ® increases with s. The total service cost is thus 
q Ws), 

The vertically integrated consumer price (p™) and ser- 
vices (s™) maximize 


lp — ¢ — ®(s)|]D(p, 5). 


chising game” in the supplementary section of the Game Theory’ User's Manual 
(a similar point is made by Gallini and Wright 1987 | 


14. Formally, the double-marginalization and input-substitution problems aiso 
relate fo moral hazard. For an example in which a promotional effort is equiva 
lent to (the opposite of a second marginalization, see below. Heré, we limit the 
extent of moral hazard to the provision of effort and services 
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Let 
TI" = [p™ — c — ®(s”)]D(p™ 5™). 


In the decentralized structure, and for a linear price p,. 
the profits are 


(py — ©)D(p.5) 
for the manufacturer and 
[p — pw — O(5)|D(p. 5) 


for the retailer. The manufacturer charges price py > ¢ in 
order to maximize his profit. The retailer then maximizes 
his own profit with respect to p and s. The distortion 
in retail price—the second marginalization—is familiar 
from example 1. The distortion in services resembles the 
distortion in retail price: The retailer does not take into 
account the extra profit for the manufacturer associated 
with an increase in services, 


Again, this is because the manufacturer's mark-up makes 
the retailer's profit margin smaller than that of the vertical 
structure. Thus, for any retail price, the retailer provides 
too few services and so causes the demand to be too 
small 

The analogy between services and retail prices is not 
fortuitous. As in chapter 2, it is useful fo consider the 
extreme case in which services are perfect substitutes for 
price discounts—i.e.,, q = D(p — s) and ®(s) = s—and 
high prices thus correspond to low services. Of course, 
the interesting and general case arises when services and 
price discounts are imperfect substitutes. 

To encourage more promotional effort and obtain the 
vertically integrated profit, the manufacturer can (as above) 
make the retailer a residual claimant by choosing p, = c. 
and can appropriate the retailer's residual profit through a 
franchise fee of A = N™ 

RPM alone is no longer sufficient. To realize the ver- 
tically integrated profit, the manufacturer should set the 


15. In the same way that the manufacturer may try to control the retail price 
in order to eliminate the price extemality he may try fo control services to 
eliminate the service externality. Direct supervision of the provision of services 
is often used as a complement of monetary incentives to induce a retailer to 
provide these services The manufacturer can then use termination of the 
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retail price at p = p™. However, the service externality 
remains.'* 


Exercise 4.3°* Show that quantity forcing is a sufficient 
instrument (ie., that, together with quantity forcing, a 
linear price yields the vertically integrated profit). 


Bilateral moral hazard arises when the manufacturer 
provides services that are hard to measure precisely (brand 
advertising, product quality, and so forth). In such cases 
the manufacturer generally exerts an externality on the 
retailer, because his services affect the retailer's demand 
and profit. This externality would not matter if the re- 
tailer’s profit margin were zero; however, as we saw, the 
manufacturer may want to leave the retailer a positive 
profit margin in order to encourage him to supply promo- 
tional services. The vertical structure must then design an 
incentive scheme that alleviates the moral-hazard problem 
on both sides. For this purpose, a simple two-part tariff is 
no longer sufficient. As we saw, to induce the retailer to 
provide the correct amount of promotional effort, one 
must make him the residual claimant. The manufacturer's 
profit margin, p, —c, is then zero, which means that 
the manufacturer has no incentive to expand demand 
(and, thus, no incentive to supply services). Conversely, 
if the manufacturer is made a residual claimant, the re- 
tailer has no incentive to supply promotional effort; more 
sophisticated schemes must then be designed. The follow- 
ing exercise shows that both parties can simultaneously 
be made residual claimants through the use of a third 
party (Holmstrém 1982). However, the corresponding 
scheme is not immune to collusion between the retailer 
and the manufacturer. 


Exercise 4.4°* Let the demand function be q = D(p,s, 5), 
where S denotes the services offered by the manufacturer. 
(D increases with S.) To simplify, assume that, to supply 
S, the manufacturer incurs a cost ¢(S) that is independent 
of the quantity sold (g’ > 0). Let (p™,5™,5™) maximize 
the vertically integrated profit, 


[p —c — ®(s)|D(p,s, S$) — o(S). 


contract as a threat against cuts in services. (In order for termination to impose 
4 cost on the retailer, it must be the case that the retailer enjoys a rent from his 
relationship with the manufacturer, 1.¢. that he obtains more than his reserva 
tion profit. On this see the chapter on the theory of the firm.) 
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In the decentralized structure, p and s are chosen by the 
retailer, and S by the manufacturer. The manufacturer and 
the retailer can contract jointly with a third party. This 
third party, called a marginal source, is willing to sign any 
contract that gives him a non-negative profit. 

(i) Show that the vertically integrated profit can be 
realized in the following way: The intermediate good is 
transferred through the third party. The third party pays 
the manufacturer according to a linear tariff T,(q) = p,q. 
with p, = p™ — (s™), and is paid by the retailer accord- 
ing to a two-part tariff T,(q) = A + cq. How would you 
choose A? 

(ii) Explain the term “marginal source.” Is the three- 
way contract immune to a manufacturer-retailer coalition? 


Example 3; Input Substitution 


Now suppose that the downstream unit is an industry 
that produces the final good from several inputs. To sim- 
plify, assume that the downstream unit uses two inputs: 
the manufacturer's good and a second intermediate good 
produced competitively at cost (and sold at price) c’. 
Aside from the final price, the downstream unit must 
choose inputs x and x’ to produce output q = f(z, x’). The 
two inputs are substitutes in the production function. 
Assume, further, that the technology exhibits constant 
returns to scale (f is homogeneous of degree 1). The 
demand function is q = D(p) (we abstract from promo- 
tional effort, for simplicity). The vertically integrated pro- 
fit is 

TI™ = max | P( f(x, x')) f(x.x') — ex — cx], 


where P(:) = D~'(-) is the inverse demand function. Let 
x™ and x'™ denote the optimal inputs. 

Consider the decentralized structure in which the manu- 
facturer of the first input x and (for the moment) the 
downstream industry have monopoly power. See figure 
4.6. (Obtaining the basic externality does not require the 
monopolization of the downstream industry when there 
are several inputs. The differences introduced by down- 
stream competition will be discussed below.) 

Under linear pricing, the monopoly manufacturer charges 
a wholesale price p, > c, while the competitive manufac- 
turers charge p, = ¢’ for the second input. Hence, the 
relative price of inputs for the downstream unit, p,/p%, 
= p,/c’, exceeds their true relative price, c/c’, for the 
vertical structure. The downstream unit thus substitutes 
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toward the second input and consumes too little of the 
manufacturer's intermediate good (Vernon and Graham 
1971; Schmalensee 1973; Warren-Boulton 1974). The 
downstream unit, when substituting toward the second 
input, does not take the upstream monopolist’s marginal 
profit (p, — c) into account (whereas it does not exert 
any externality on the second input industry, whose mar- 
gin is zero). 

OF course, the input-substitution problem arises only 
when substitution is possible. It cannot occur in a fixed- 
proportion situation, consumption of left and right shoes 
or of bolts and nuts. (As Bowman [1957] notes, “a mo- 
nopoly of bolts if nuts are competitive is as good as a 
monopoly of bolts and nuts.”) 

To realize the vertically integrated profit, the upstream 
monopolist need not integrate toward the second input 
industry, because the latter exerts no externality (it con- 
tents itself with selling at marginal cost). Thus, it suffices 
to integrate vertically. as shown in figure 4.7. Alterna- 


Py 


Pl q=f(x.x') 


Figure 4.7 
Vertical integration with several inputs, 
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tively, several vertical restraints can be substituted for 
vertical integration: 


Franchise fee The monopolist can make the downstream 
unit the residual claimant: 


Pw = 6 
A= Plf(x™,x"™)) f(z.) — cx™ — c'x™. 


As in examples 1 and 2, vertical control increases welfare. 
The downstream monopolist pays a lower marginal price 
for the first input and therefore charges a lower price to 
consumers. Furthermore, the input mix is now efficient. 


Tie-in together with RPM _ As was shown by Blair and 
Kaserman (1978), tie-ins have a very desirable property in 
the presence of input substitution: They allow the relative 
price of inputs to be the “correct” one. To see this, let the 
upstream monopolist force the downstream unit to buy 
the second intermediate good from him at price pi, (fig- 
ure 4.8). Suppose that the upstream monopolist chooses 
prices for the intermediate goods proportional to their 
marginal cost: 


Pw! Po = cle" (4.1) 


The downstream unit minimizes cost over the two inputs. 
A well-known condition for cost minimization is that the 
marginal rate of substitution between inputs be equal to 
the ratio of their prices. Hence, 
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Figure 4.8 
Tie-in. 


Equation 4.2 ensures that the downstream unit uses the 
inputs in the right proportions.'® Furthermore, it takes us 
back to the one-input case. Equation 4.2 determines x’ as 
a function of x, say. Hence, the upstream monopolist 
needs only one more instrument to take care of the retail- 
price externality. Resale-price maintenance is such an in- 
strument. To realize the vertically integrated profit 1", 
the upstream monopolist must obviously impose p = p™ 
Last, to appropriate the downstream unit's profit, the up- 
stream monopolist chooses a p,, and a py, satisfying equa- 
tion 4.1 and 


Pu X™ + pyx'™ = pf (xx) (4.3) 


Because f is homogeneous of degree 1, the downstream 
unit cannot improve on the monopoly allocation'” and 
makes zero profit. 

A limit to the use of tie-ins is that, in general, there are 
many inputs (including labor), and the monopoly pro- 
ducer of an input must tie all other inputs (or at least 
all those that are good substitutes with his product). 
Such broad ties are infrequent. The case of a durable- 


16. For instance, for a Cobb-Douglas production function 


fini =kar(x)'* 
‘one has 
xix = {a/(1 = a)|(ee) = 27/2" 


More generally, the partial derivatives of a function that is homogeneous of 
degree 1 are homogeneous of degree 0, Hence, 
eflax® ix) _ Afix™2™) 


io tr 


and similarly for the derivative with respect to + The solutions to equation 4,2 
have the form 


\r= amr = a2} 


17. To minimize cost, the downstream unit would choose x= 2" and r 
= 22 for some « > 0 However, from equation 4.3 and the homogeneity of | 


PoX + Pax = 4\ pet" + pax™) 
= priafe™ 2") 


= prrix.x 
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good monopolist, in which the substitute “input” is main- 
tenance, is a good application of the previous analysis 
(see exercise 4.5). 

Let us briefly consider the new features associated with 
a competitive downstream industry. Assume that the 
downstream industry sells at its marginal cost (the 
minimal cost given intermediate prices p, and pi,).!° A 
franchise fee with p,=c is no longer sufficient; the 
downstream industry then sells at a price equal to the 
marginal cost for intermediate prices c and c’. Thus, the 
consumer pays the vertical structure's marginal cost (foo 
low a price, from the viewpoint of the vertical structure), 
and no profit is realized. Hence, no franchise fee can be 
imposed on the downstream firms. 

In contrast, a tie-in (with prices determined by equa- 
tions 4.1 and 4.3) is a sufficient instrument. RPM is not 
needed, because downstream competition eliminates the 
second marginalization. 

A royalty on the final output is also sufficient; the 
upstream monopolist charges p,, =c in order not to in- 
duce input distortion, and then realizes the integrated 
profit by taxing final output.'? 


Exercise 4.5"* Some durable-good producers tie the pur- 
chase of spare parts and maintenance to the purchase of 
the good. (For instance, Boeing used to tie spare parts to 
the sale of commercial jets through requirements provi- 
sions, and required subcontractors to destroy any produc- 
tion overruns of spare parts.) The purpose of this exercise 
is to show that the input-substitution model offers a pos- 
sible explanation of this practice. (Can you think of alter- 
native explanations?) 

A monopolist produces a durable good at unit cost c. 
A competitive industry offers maintenance at unit cost 
p =’. Another competitive downstream industry uses 
the durable good to produce the final output. One unit of 
the durable good in working condition produces one unit 
of the final output per unit of time. Time-is continuous, 


and the rate of interest is r. The firms in the downstream 
industry consume x’ units of maintenance per unit of 
durable good and per unit of time. The conditional prob- 
ability of breakdown of the durable good is x(x')dt be- 
tween t and t + dt, where a’ < O and x" > 0, (Breakdown 
means that the durable good must be replaced.) Let py 
denote the price of the durable good (assume that this 
price is constant over time, to avoid the commitment 
issues discussed in the supplementary section of chapter 1). 
(i) Show that the price of the final good is 


pH=ex +pylr+ ax')), 


where x’ minimizes the right-hand side. 

(ii) Show that the downstream industry "consumes" 
too much maintenance from the vertical structure's point 
of view. How can the durable-good monopolist solve this 
problem? 

(ii) Draw the analogy with the input-substitution model. 


These three famous examples—double marginaliza- 
tion, downstream moral hazard, and input substitution— 
show that vertical integration or vertical restraints need 
not be detrimental to welfare, even when they are meant 
to increase monopoly profit.?° In such circumstances, the 
issue is the existence of a monopoly power per se, not its 
by-products (vertical integration or vertical restraints). 
We will see in the supplementary section that vertical 
restraints may be privately desirable and at the same time 
socially undesirable. One should be cautious when assess- 
ing the effects of such restraints, but unqualified hostility 
toward vertical restraints is inappropriate. 


_ 
4.3 Intrabrand Competition 


In most of section 4.2 we assumed that the retailer had 
monopoly power. In this section we examine the polar 
case in which the downstream sector is competitive. We 
focus on the provision of promotional services.?! 


18 Because f exhibits constant returns to scale. the marginal cost is indepen- 
dent of the scale of operations 


19. The adequate royalty per unit of output is 
(tat = x" — qm 


20. The above models do not “explain” the existence of vertical restraints (such 
as resale price maintenance and teins). In all these models. a franchise fee 
suffices to obtain the vertically integrated profit. And it is difficult to argue that 
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franchise fees are more costly to administer (invalve higher transaction costs) 
than RPM or tie-ins. 


21. The issue of double marginalization disappears because downstream com 
petition brings the second pnce-cast margin to zero. (The case of “differentiated 
retailers” [see chapter 7 for the definition of differentiated products] is inter 
mediate between the monopoly case and the competitive case Differentiathon 
allows retailers to introduce a second marginalization) The effect of compet) 
tion on the input-substitution problem was analyzed in example 3 
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4.3.1 Intrabrand Competition and Retail Services*? 


Here we use the model of example 2 of section 4.2. The 
demand function for the final good is q = D(p,s) for a 
package of p (final price) and s (services). To abstract from 
the discrimination issues of chapter 3, we assume that all 
consumers are identical. Their net consumer surplus is 
S(p.s), with 05/ép = —D(p,s). The retailers, who are all 
identical, incur cost (s) per unit of output for service s. 

The vertically integrated profit is obtained by choosing 
p™ and s™ so as to maximize 


[p —c — O(s)]D(p,5). 


The first-order condition for services is 


[p™=e= are = '(s™)D. (4.4) 
és 

Consider now the decentralized structure. 

Consumers will buy from the retailer who offers them 
the best package of price and services. We can then for- 
malize perfect competition in this model as retailers offer- 
ing consumers their most preferred price and services 
package subject to the condition that the retailers not lose 
money. In other words, the competitive price and level 
of services maximize S(p,s) subject to p = py + (5), 
where py is the intermediate price charged by the monop- 
olist. Substituting p. the competitive sector maximizes 
Sip, + ®(s),s). The first-order condition for services is 
(remember, the derivative of consumer surplus with re- 
spect to price equals — D) 


= =015)D. (4.5) 


Comparing equations 4.4 and 4.5, we see that com- 
petition introduces a bias in the choice of services. The 
right-hand side is the same for the decentralized and the 
integrated structures: The cost of a unit increase in ser- 
vices equals the marginal cost of services times demand. 
The left-hand side differs between the two arrangements. 
The integrated structure considers the marginal revenue 
accruing from the increase in demand. Competition con- 


siders the marginal surplus, which embodies the increase 
in demand for all inframarginal units 


os = éD 
G-[ ae) 


The comparison is thus analogous to that in chapter 2 
between the qualities chosen by a monopolist (who con- 
siders only the effect of an increase in quality on the 
marginal consumer) and a social planner (who considers 
the effect of an increase in quality on the average con- 
sumer). This analogy is not coincidental; the vertically 
integrated solution is nothing but the monopoly solution, 
and competition maximizes social welfare subject to the 
fact that input is bought at price p, rather than c. 

From the analysis in chapter 2, we conclude that the 
competitive retailers may provide too few or too many 
services (from the vertical structure’s point of view), de- 
pending on whether services are more valued by mar- 
ginal or inframarginal consumers. The welfare analysis of 
vertical integration can be shown to be ambiguous. The 
provision of services under retail competition is socially 
optimal given the wholesale price. However, the whole- 
sale price is the monopolist’s choice and may exceed the 
fictitious wholesale price under vertical integration.?> 


Exercise 4.6"* In the previous model, which of the fol- 
lowing instruments is sufficient? 

(i) a franchise fee 

(ii) resale-price maintenance 


4.3.2 The Horizontal Externality 


In the previous model, the retailers exerted an externality 
on the manufacturer through their competition (they did 
not offer the level of services he preferred). In this subsec- 
tion, we tackle the issue of externalities among retailers. 
We shall consider the provision of pre-sale information 
(or advertising) by one retailer to consumers who ulti- 
mately buy from other retailers. Pre-sale information (e.g., 
literature, test drives, demonstrations by salespeople) 
matters for complex durable goods, such as automobiles, 
cameras, and stereo equipment. Telser (1960) argues that 


12. The exposition here follows Caillaud and Rey 1986. 


23, This fictitious wholesale price is equal to p™ — @(s*), From example 2 in 
section 42. the manufacturer can realize the vertically integrated profit by 
zranting one retailer a monopoly position on retail (or, equivalently in this 
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model, granting exclusive territories on portions of the total demand to several 
retailers} and using an instrament that 1s sufficient in the one-retailer case (2 
franchise fee. for instance). Hence, we have an example in which eliminating 
retail competition increases the manufacturer's profit 
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retail competition may prevent the provision of such in- 
formation. A retailer who incurs the cost of providing the 
information must charge a higher price than a retailer who 
does not provide the information. Consumers then have 
an incentive to visit the first retailer to obtain information 
and then buy from the second. 

This phenomenon is starkly illustrated by considering 
the polar case in which services cannot be appropriated 
by the retailer who provides them. Suppose that the de- 
mand is q = D(p,5), where 5 is the maximum of services 
offered by any retailer and p is the lowest price charged 
by any retailer.2* @(s) is now the unit cost of services 
to a consumer who visits the store and does not buy. 
Assume that ®(0) = 0. A retailer who offers services 5 
must make enough in sales to cover his additional costs, 
At the same time, his retail price must not exceed the 
wholesale price, because another retailer could lower his 
price slightly, provide no services, and take all the de- 
mand. In other words, the competitive price for a given 5 
is p = py. However, 


p— [Pw + O(5)] <0. 


Hence, 5 = 0. No services are provided. 

More generally, the horizontal externality gives rise to 
a public-good problem. Retailers free-ride on one another. 
The public good—the information provided to the con- 
sumers—is therefore undersupplied. (It is not supplied at 
all in our extreme example.) To encourage an adequate 
provision of services by retailers, competition must be 
reduced or eliminated. The manufacturer must give the 
retailers a property right on their services by protecting 
them against unfair competition. Competition-reducing 
restraints, such as RPM and exclusive territories, are 
adequate to serve that purpose. RPM prevents the emer- 
gence of discount stores and encourages consumers to 
buy where the services are provided (because they will 
not find a better price elsewhere), Exclusive territories will 
also suffice. For instance, the manufacturer can grant a 


24. A finer description of the effect of pre-sale information on demand would 
be desirable here. One can imagine that consumers go first to the retailer with 
the highest level of pre-sale information and then choose from among retailers 
con the basis of price only. 


25. See Perry and Porter 1986 for an analysis of retail-service extemalities in a 
diferent product-differentiation space 
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monopoly position to a retailer. The absence of competi- 
tion precludes any horizontal externality. Such restraints 
are generally welfare-enhancing, because they allow the 
tetailers to supply valuable information to the consumers 
(although one must, as usual, be cautious and consider the 
change in final price associated with the change in ser- 
vices when moving from the downstream competitive 
solution to the downstream monopoly solution).?* 

Mathewson and Winter (1984) build a model in which 
retailers are differentiated by their locations (see the 
model of spatial differentiation in section 2.1). They ad- 
vertise the product locally (ie, to their own customers), 
but some spillovers (due to word-of-mouth communica- 
tion among consumers, say) create a positive externality 
on other retailers in other geographical areas.?° As above, 
this externality must be encouraged. For instance, if ex- 
clusive territories are granted, the optimal two-part tariff 
makes each retailer a “more than residual claimant”: p,, <c. 
The intermediate good is subsidized, giving each retailer 
an extra margin that encourages him to advertise more, 
which benefits the manufacturer through the positive 
externality on other retailers. 

The horizontal-externality argument is often invoked 
to explain the existence of RPM. Often, however, RPM is 
imposed on goods that involve relatively few inappro- 
priable pre-sale services?’ Lately the externality argu- 
ment has been extended to goods for which the distribu- 
tor offers quality certification. The idea is that pre-sale 
services need not take the form of time spent with the 
customer, free brochures, and the like. Rather, the simple 
fact that a certain retailer carries the product may signal 
that the product is of high quality. Prestigious stores such 
as Bloomingdale's have built reputations for picking qual- 
ity items. Such a store is willing to engage its reputation 
by carrying a given product only if the product is suffi- 
ciently profitable—in particular, if it is not carried by 
discount stores. Resale-price maintenance is one way of 
keeping consumers from buying from a discount store 


26. A similar externality arises among a group of franchisees using a common 
name. A McDonald's franchisee who cuts quality hurts the other McDonald's 
franchisees. Indeed. McDonald's goes to great lengths to measure quality at 
each location using a “quality-service-cleanliness index.” 


27 See Overstreet 1983 and Steiner 1985 for critiques of the too-frequent 
application of this theory 
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after getting the message about the high quality of the 
good from a prestigious store.?® 


4.3.3. Differentiated Retailers 


In the previous two examples, retail competition hurts the 
manufacturer, who might as well grant a retail monopoly. 
In general, however, a manufacturer may want to keep 
several retailers. First, competition may discipline the re- 
tailers, as we shall see in the next subsection. Second, 
consumers may be heterogeneous, and having retailers 
located at different points in geographical or quality space 
(see section 2.1) enables a manufacturer to better appro- 
priate their surplus.2? The manufacturer can choose the 
number of retailers either directly, or indirectly through 
the level of the franchise fee (the retailers then enter until 
they make zero profit). Of course, the manufacturer cares 
not only about the number of retailers but also about their 
locations. It turns out that for homogeneous geographical 
spaces (such as a uniform distribution of consumers along 
a circle—see chapter 7) there is no conflict between the 
manufacturer and a given number of retailers as to their 
locations. This is because the retailers try to differen- 
tiate themselves as much (i.e., locate as far apart) as they 
can in order to avoid intense intrabrand competition (see 
chapter 7). The vertically integrated structure would also 
differentiate retailers as much as possible in order to cap- 
ture consumer surplus. In contrast, a conflict as to location 
arises in more general product spaces (see Bolton and 
Bonanno 1985 for a model of quality differentiation with 
such a conflict; see also subsection 4.3.1 above). 


4.3.4 Retail Competition as an Incentive Device 


Since a retailer is an agent for the manufacturer (in an 
economic sense, not necessarily in a legal sense), he must 
be given incentives to choose the adequate level of pro- 
motional services, retail price, etc. The deterministic en- 
vironment of section 4.2 allowed the manufacturer to 
perfectly control each retailer's actions and to realize the 
“vertically integrated” (symmetric-information) profit— 
for instance, by using the two-part tariff. However, as was 
emphasized in the chapter on the theory of the firm. 


uncertainty and asymmetric information generally create 
a real control problem by introducing a basic trade-off 
between insurance and incentives. The risk-averse agent 
(retailer), who is provided with some insurance, tends to 
have too little incentive to take actions that are costly to 
him (high promotional effort, low retail price, etc.) but 
profitable to the principal (manufacturer) 

Competition on the retail market may alleviate the 
agency problem, because (as bizarre as this may seem) it 
provides some insurance to the retailers. Suppose, for 
instance, that a retailer is hurt by an increase in retail cost 
(a direct effect). If he competes on the retail market and 
his rivals are also hurt by the increase in retail cost (such 
a correlation is not unreasonable, given that they are in 
the same market), these rivals raise their prices or lower 
their services, which raises the demand for the first retailer 
and reduces his loss in profit. This indirect effect does not 
exist for a monopoly retailer. Thus, competition smoothes 
the retailer's profit stream over states of nature (potential 
retail costs). The same argument applies to shocks in 
demand in this retail market. Because competition has 
desirable insurance properties, and thus alleviates the 
trade-off between insurance and incentives, the manufac- 
turer can give more incentives to his retailers when the 
latter compete in the product market. Thus, competition 
can also be seen as an incentive device. 

On the other hand, competition is destructive of profit, 
as will be emphasized throughout part Il. Yet a monopoly 
retailer is not constrained by competitive pressure, and 
can fully extract the monopoly profit from his retail mar- 
ket. We thus conclude that the manufacturer faces a trade- 
off between high incentives (provided by retail competi- 
tion) and the optimal exploitation of monopoly power 
(obtained by allowing a single retailer, or else by impos- 
ing exclusive territories), For a formal model, and for a 
discussion of the welfare implications of this, see the 
supplementary section, 


4.3.5 Dealer Cartels 


Competing retailers may pressure a manufacturer to im- 
pose competition-reducing vertical restraints. As an ex- 
treme example, consider the case of a group of retailers 


28 See Oster’s analysis of Levi Strauss and Greening’s of Florsheim Shoes in 
Lafferty et al, 1984. For a formal analysis of quality certification. see Marvel- 
McCafferty 1984 
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29. See Dixit 1983 and Mathewson and Winter 1984, 1986 for spatial models 
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who purchase an intermediate good at its competitive 
price, py = c. That is, assume away upstream monopoly 
power. These retailers compete on the final price. Thus, 
the final price is equal to their marginal cost (that is, if 
they do not have other retail costs, p = c), and they do 
not make a profit. (See chapter 5 for a discussion of this 
conclusion.) Suppose now that they have the clever idea 
of creating a “trademark.” They create an institution that 
“certifies” the product. (Recall, however, that there is no 
quality problem here.) In tum, this upstream institution 
“imposes” resale-price maintenance or exclusive territories 
on them. RPM specifies a minimum retail price (for in- 
stance, the monopoly price) that exceeds the marginal 
cost c; exclusive territories divide the market into sub- 
markets in which a retailer has monopoly power. In both 
cases, the creation of a phony upstream institution allows 
the retailers to make profits by reducing competition. 

Of course, this simple story begs some questions: What 
happens if a retailer does not comply with the trademark 
agreement? Can the retailers impose vertical restraints 
on noncompetitive upstream manufacturers? However, it 
starkly illustrates the issue. In this framework, the ver- 
tical restraints are not meant to enhance the efficiency of 
the vertical structure. They are introduced only to allow 
the retailers to raise the consumer prices above marginal 
cost. Such vertical agreements are just a veil for horizon- 
tal collusion, They lower welfare in the same way that 
monopoly pricing lowered welfare in chapter 1. 

At the current level of comprehension of the issue, it is 
generally admitted that restraints imposed by a retailers’ 
cartel are harmful. Even economists associated with the 
Chicago school, who usually see vertical integration and 
vertical restraints as increasing the efficiency of vertical 
structures and welfare,*° have argued against the legality 
of retailer-imposed restraints.>! 


4.4 Interbrand Competition 


Here the link between vertical control and the consumers’ 
choice between products will be discussed briefly. Two 


cases will be distinguished. In the first, the vertical re- 
straint is imposed as a way to enhance efficiency by 
encouraging the manufacturer to provide services. In the 
second, thé manufacturer imposes a vertical restraint to 
affect the behavior of its upstream rivals.3? 


4.4.1 Exclusive Dealing and Efficiency 


Exclusive dealing—the stipulation that a retailer may 
not sell a brand that competes with the manufacturer's 
product—may result in a loss of returns to scale. For 
instance, the retailer's employment is increased and con- 
sumers’ search costs are reduced when the retailer carries 
several products. However, exclusive dealing may also 
bring a gain in efficiency. The argument here is the mir- 
ror image of that advanced in subsection 4.3.2, which 
was that a retailer may have to be granted an exclusive 
territory (or, more generally, a downstream-competition- 
reducing restraint) to give him incentives to provide pre- 
sale information. Similarly, exclusive dealing (a priori, an 
upstream-competition-reducing restraint) may induce the 
manufacturer to supply promotional services. The point is 
that the manufacturer may promote the product, give the 
retailer a certain location, and so forth, but the retailer 
may induce the consumers who visit him to buy the 
competing brand (which, presumably, carries a higher 
profit margin for the retailer if the competing manufac- 
turer does not incur the same promotional expenses). 
Exclusive dealing is then seen as a way of giving the 
manufacturer a property right on his promotional ex- 
penses (Marvel 1982). 


4.4.2 Vertical Restraints and Upstream Strategic 
Behavior 


It has been argued that some vertical restraints are used 
by manufacturers to restrict upstream competition. 

The first and best-known argument is that exclusivity 
contracts (exclusive dealing, long-term contracts with re- 
tailers) form a barrier to entry. Such contracts force new 
manufacturers to set up their own distribution networks 


30, See, e.g. Posner 1981 


31 See eg, Posner 1976 In 1981, the new assistant attomey general in 
charge of the antitrust division of the US. Department of Justice, William 
Baxter, testifying before a congressional committee, stated: “In my view, there 
is no such thing as a vertical ‘problem’. The only possible adverse competitive 
consequences of vertical arrangements inhere in their horizontal effects. Only 
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where vertical arrangements facilitate restricted output and raised prices— 
honzontal impacts—should they be inhibited.” (quoted in Howard 1983 on 
pp. 150-151) 


32. This case relates to strategic competition. which is the subject of part II ot 
the book Thus, only the main arguments will be mentioned here 
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(which is costly, whether or not the new distributors can 
quickly offset their disadvantages in terms of goodwill 
and experience). Thus, new manufacturers are less inclined 
to enter. (A variant of this important argument is studied 
in the supplementary section, where it is shown that pri- 
vate contracting tends to excessively foreclose the access 
to markets for new entrants.) 

The second argument is one of market discipline. Telser 
(1960) and Posner (1977) have argued that RPM can help 
competing manufacturers sustain collusion by reducing 
the efficacy of secret wholesale-price cuts (see chapter 6 
on the factors that facilitate or hinder collusion). In a 
somewhat different vein, Bonanno and Vickers (1986), 
Gal-Or (1987), and Rey and Stiglitz (1986) have shown 
that restraints that reduce downstream competition (such 
as exclusive territories) may also soften upstream com- 
petition. Thus, the manufacturers may also adopt vertical 


restraints for strategic purposes.>* 


Neen ee eee  EEEESEEEEEEEIEEEESEEEEEEEEET 
4.5 Concluding Remarks 


In a single-manufacturer, single-retailer, deterministic en- 
vironment, the basic vertical externality associated with 
linear pricing creates an excessive demand contraction, 
even from the viewpoint of the vertical structure, Re- 
straints that correct this externality tend to be wel- 
fare improving. Thus, vertical restraints that reduce com- 
petition (either downstream or upstream) seem better 
candidates for the economist’s attention. Although the 
horizontal-externality argument associated with pre-sale 
services rightly calls for the suppression of downstream 
competition, the two models of competition analyzed in 
the supplementary section (of retail competition as an 
incentive device, and of long-term contracts as a barrier to 
entry) both yield the conclusion that private contracting 
yields too much market foreclosure (too little competi- 
tion) from a social point of view. Competition-reducing 
restraints should be a focus of research, as much work 
remains to be done in this crucial area of vertical control. 

Theoretically, the only defensible position on vertical 
restraints seems to be the rule of reason. Most vertical 


restraints can increase or decrease welfare, depending on 
the environment. Legality or illegality per se thus seems 
unwarranted. At the same time, this conclusion puts far 
too heavy a burden on the antitrust authorities. It seems 
important for economic theorists to develop a careful 
classification and operative criteria to determine in which 
environments certain vertical restraints are likely to lower 
social welfare. 


53. The treatment of these contributions, which requires a good comprehen 
sion of parts af chapters 5 and &. is beyond the scope of this overview of 
vertical restraints, Bee review exercise 19 
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4.6 Supplementary Section: 
Competition-Reducing Restraints 


In this section we shall analyze two models of market 
foreclosure. In the first, the manufacturer decides whether 
to let simultaneous (product-market) competition operate 
between his retailers (or licensees, or downstream firms). 
The second model considers sequential competition in a 
model of supplier switching, and analyzes whether equi- 
librium long-term contracts between a buyer and a sup- 
plier forms a barrier to entry for other suppliers. In both 
models, it is found that private contracting yields too 
much foreclosure—i.e., too little competition—from a 
social viewpoint. 


4.6.1 The Role of Retail Competition in Promoting 
Efficiency** 


We saw in section 4.3 how retail competition may hurt 
the manufacturer. Competition puts constraints on re- 
tailers and prevents them from extracting the monopoly 
profit. To realize the (higher) vertically integrated profit, 
the manufacturer prevented competition through, for in- 
stance, exclusive territories. A retailer who is given a 
monopoly situation on part of the demand and who pur- 
chases the intermediate good at marginal cost internalizes 
the vertical structure’s objectives. He thus makes the deci- 
sions that yield the vertically integrated profit, which can 
then be captured by the manufacturer through a franchise 
fee, In such a world, the imposition of exclusive territories 
cannot hurt the manufacturer. Nor can the imposition of 
the other main intrabrand-competition-reducing restraint, 
resale-price maintenance. If retailers face no uncertainty, 
the manufacturer can perfectly foresee the final price they 
will charge, so he could always duplicate the situation 
without restraints by imposing the retail price that would 
then prevail. 

This supplementary section begins with a demonstra- 
tion that in the presence of uncertainty and asymmetric 
information, competition-reducing restraints have some 
drawbacks, First, they may not allow the efficient use of 
the information held by retailers. Second, they may give 


34. These comments are derived from Rey and Tirole 19866. 


35, In the chapter on the theory of the firm. it was noted that product com 
petition may help to discipline the firm's managers (see, e.g. the Hart argu 
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the retailers an inadequate amount of insurance. Competi- 
tion among retailers may become a profitable option for 
the manufacturer? (although competition creates its own 
vertical inefficiencies, particularly by constraining the re- 
tailers’ choices). 

After discussing these issues in broad terms, we con- 
sider a more formal model of retail competition. In this 
model, retail competition either is or is not optimal for 
the manufacturer; however, it is always socially supe- 
rior to no retail competition. We explain why the con- 
sumers, whose interests are not accounted for in the 
arrangement between manufacturer and retailers, prefer 
retail competition. 


4.6.1.1 Uncertainty, Delegation, and Insurance 


Uncertainty 


Two types of uncertainty faced by retailers in a given 
market can be distinguished (think of retailers within the 
same geographic area): demand uncertainty and retail- 
cost uncertainty, Demand varies with the consumers’ 
tastes or with the degree of interbrand competition; retail 
cost is affected by technological progress, wages, input 
prices, and so forth. Assume that the uncertainty affects 
all retailers in a given market in the same manner. At the 
point in time when retailers and manufacturers make con- 
tracts, everyone has the same beliefs about the potential 
realizations of demand and retail-cost uncertainty. After 
the contracts are signed, the uncertainty is resolved and 
learned by the retailers, who then take some actions (for 
instance, the choice of the retail price or promotional 
effort) that affect the vertical structure's targets. The right 
to make decisions in response to changes in the economic 
environment is delegated to the retailers. 

To illustrate the issues, it suffices to consider the fol- 
lowing simple framework (the formal development of this 
framework is given below): There are » retailers in a given 
market (n > 1). The manufacturer charges two-part tariffs 
to his retailers. As before, the form of the tariffs is 


Tq) =A + pug. 


where gq is the quantity of the intermediate good bought 


ment). A somewhat similar phenomenon occurs here with the retailers-—with 
the important difference that the existence of product competition is a choice 
variable for the manufacturer. 
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by the retailers. Under retail competition, the retailers 
choose retail prices. Consumers, who consider the re- 
tailers to be identical, go to the retailer with the lowest 
price. Their demand is q = D(p.d), where p is the lowest 
retail price and d is the parameter of demand uncertainty 
(D decreases with p and increases with d). The retailers 
have a retail cost y per unit of sales, where 7, like d, is 
uncertain at the contract date but is revealed to the re- 
tailers before the final price is chosen. The manufacturer 
may impose exclusive territories or RPM, depending on 
whether such restraints are enforceable. Exclusive terri- 
tories divide consumers into n groups. Each retailer then 
has monopoly power and faces demand q = D(p,d)/n. 
For RPM, the manufacturer fixes p in the contract. As- 
sume that the retailers then share the demand equally: 
Each sells. ¢ = D(p,d)/n.26 


Delegation 


The manufacturer would like the vertical structure to 
yield the vertically integrated profit (even if this profit is 
realized by retailers, because a franchise fee can be used to 
appropriate the retailers’ profits). The ex post vertically 
integrated profit is 


max |(p — c — )D(p,d)}. 
p 


The retail price that maximizes this profit is contingent on 
the realizations of the demand and retail-cost parameters; 
assume that p™(d, 7) grows with d and } (see chapter 1 for 
a proof that p™ grows with ;; reasonable conditions can 
be found so that p™ grows with das well). For instance, if 


Dip.d)=d—p 
(linear demand), then 
pd.) = (d+ + 7/2. 


The delegation problem consists of inducing the re- 
tailers to choose actions that come as close as possible 
to optimal actions—here, p™(d,7)—for all realizations 


30. Here imposing exclusive territonees on top of RPM would be redundant 


37. See chapter 5. We take a sharp view here of competition by assuming that 
the retailers cannot use tacit collusion. Also. the assumption that retailers are 
not differentiated leither naturally, or through cast and demand shocks) is 
‘meant to formalize the notion of pure competition 


38. Their objective function is 
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of demand and retail-cost uncertainty. Let us examine 
the properties of competition and those of competition- 
reducing restraints in this respect. 

Under competition, the retail price is driven down to 
the retailers’ total marginal cost, which is equal to the 
wholesale price plus the retail cost: p = py + 7.3’ The 
retail price is then entirely determined by cost. The price 
is not responsive to the demand parameter, and fully 
embodies the retail cost. In contrast, the vertically inte- 
grated price is responsive to the demand parameter and 
may embody only part of the retail cost (in the case of 
linear demand, it embodies only 50 percent of the cost). 
As in section 4.3, competition constrains retailers in the 
search for the monopoly profit. 

Like competition RPM uses decentralized information 
nonoptimally. Indeed, the retail price is fixed before the 
uncertainty is resolved, so it is not responsive to demand 
and retail-cost conditions at all 

In contrast, exclusive territories create local monop- 
olies which can adjust to the realization of uncertainty 
without being constrained by the manufacturer (as with 
RPM) or by other retailers (as in the case of competition). 
Indeed, if the manufacturer does not distort the inter- 
mediate price (p,, = c), each retailer maximizes 


(p—c—y)D(p.dl/in — A 


and thus chooses the right retail price, p™(d,7). Thus, 
exclusive territories make excellent use of decentralized 
information. 


Insurance 


If retailers are risk-averse,** the manufacturer is concerned 
with the amount of risk the retailers bear. Any increase in 
risk for the retailers reduces the franchise fee the manufac- 
turer can extract from them. Thus, the monopolist wants 
to keep the retailers from bearing risk. Let us assume that 
the manufacturer is risk-neutral.*? 

Competing retailers are perfectly insured. Their margin 


EU p — pa — sia — A), 


where g is the quantity they sell, Lis a concave von Neumann~Morgenstem 
ubility function. and E denotes the expectation over d and 


39. This assumption 1s justified, for instance when the manufacturer serves a 
large number of independent markets 


Chapter 4 


(2p — py — 7) is equal to zero for any realization of the 
cost uncertainty. Thus, their profit is deterministic. The 
intuition is as follows: When, for instance, a retailer's retail 
cost increases, his competitors’ retail costs also increase. 
The competitive pressure is lowered, in the sense that the 
competitors’ prices increase. Thus, competition has very 
desirable insurance properties. 

RPM makes the retailers bear all the fluctuations in the 
retail cost that cannot be passed on to the consumers. 
Whether the retailers bear fluctuations in demand de- 
pends upon whether the contract leaves them a positive 
profit margin (p — p, — ¥). 

Last, exclusive territories let the retailers bear the fluc- 
tuations of the vertical structure—at least, they do so if 
the retailers are made the residual claimants (p, = ©), in 
which case the manufacturer bears no risk. Hence, exclu- 
sive territories give mediocre insurance to the retailers. 

Thus, competition is superior to competition-reducing 
restraints in that it gives more insurance to retailers. (This 
is from the vertical structure's point of view; as we will 
see below, competition offers other advantages to the 
consumers.) 

Let us now develop the vertical structure's trade-off 
between competition and a competition-teducing restraint 
by completing and justifying the previous model, and let 
us introduce the welfare analysis into the model. 


4.6.1.2 A Model of Retail Competition 


Consider the model exposited in subsection 4.6.1.1: Con- 
sumer demand is equal to q = D(p,d), where p is the 
retail price and d is the uncertain demand parameter. The 
common and uncertain retail cost per unit of sales is equal 
to 7. Retailers are ex ante and ex post identical. First, they 
sign a contract with the manufacturer; second, they learn 
the realizations of the demand and retail-cost parameters; 
third they choose a retail price. 

To define the set of feasible contracts, let us now make 
assumptions about what the manufacturer observes (i.e., 
on what the contract can be contingent). As we will see, 
these assumptions imply that the only feasible contracts 
are competition and exclusive territories, both combined 
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with a two-part tariff (in particular, RPM is not enforce- 
able under these assumptions). 


Assumption 1 The manufacturer observes how much of 
the intermediate good a retailer buys directly from him, 
and whether the retailer carries the product. 


Assumption 2 The manufacturer does not observe the 
realization of the uncertainty (d and y), the quantity sold 
by a retailer, and the retailers’ prices and profits. 


‘Assuniption 3 The manulactuter serves many indepen- 
dent markets,*° and retailers engage in arbitrage (they 
“bootleg”). 


Assumptions 1 and 3, and the nonobservability of 
the quantity sold by the retailer, mean that the manufac- 
turer cannot charge different marginal prices because of 
arbitrage (as discussed above), but he can still impose a 
two-part tariff as he observes whether the retailer carries 
his product. 

The assumption that a retailer's consumer price is not 
observed by the manufacturer can be justified in two 
ways: First, the retailer may give secret price discounts to 
his customers. Second, the retailer can include in the sales 
package services that are not observable by the manufac- 
turer (or, at least, not verifiable by a court). For instance, 
suppose that the retailer offers services that have a mone- 
tary equivalent s for the consumers. The “real retail price” 
for the consumer is then p — s, where p is the nominal 
price charged by the retailer. The demand function is then 
q = D(p — s.a). Suppose further that the cost of provid- 
ing services s per unit of sales is ®(s) = s. Then the 
retailers’ margin is p — s — p, — ». It is clear that even if 
p were observable, the unobservable choice of services 
means that the retailer chooses a “generalized retail price” 
p =p —s which is not observable by the manufacturer. 
The model can also be interpreted as a model of retailers’ 
incentives to supply services. Anticipating what follows, we 
will obtain the conclusion that product-market competi- 
tion acts as an incentive device. We saw in the chapter 
on the theory of the firm that there is a basic conflict 
between insurance and incentives in agency. As competi- 
tion is a good insurance device. the manufacturer can thus 


40. Here “independent” means that the realizations of the uncertainty in differ- 
ent markets are statistically independent, as (for instance) in the case of geo: 
graphic markets (cities, regions, etc ) with local demand and cost conditions 
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induce the retailers to supply more services (promotional 
effort) by forcing them to sell in the same market. 

Hence, only two-part tariffs are enforceable. To intro- 
duce a choice between competition and a competition- 
reducing restraint, let us make another assumption: 


Assumption 4 The manufacturer can divide the market 
into n submarkets over which a retailer has monopoly 
power (exclusive territories) if he so chooses. 


The optimal contract between the manufacturer and re- 
tailers is thus a two-part tariff, accompanied either by price 
competition among retailers or by exclusive territories.** 

The manufacturer is risk-neutral. His profit for a total 
quantity Q = nq sold is 


n(A + pwa)- 


A retailer makes a profit of 
(P— Pe—vq—A 


when he sells an amount q, He is risk-averse, and he has a 
von Neumann—Morgenstem utility function U (UI > 0, 
U” < 0). His reservation profit is zero. The expected util- 
ity derived from the retailing contract must exceed the 
retailer's utility when he does not sign this contract: 


EU((p — py — 7)4 — A) > U(O), (4.6) 


where the expectation is taken over d and jy. It is clear 
that, in equilibrium, equation 4.6 is satisfied with equality 
because the manufacturer can always raise the franchise 
fee if the retailer's expected utility strictly exceeds his 
reservation utility. We will consider a family of utility 


4), Could the manufacturer not do better by using the perfect correlation 
between the retailers’ realizations of uncertainty. as is done in the “yardstick 
competition” literature (see the chapter on the theory of the firm)? Here. final 
sales are not observed. Hence, any information that the manufacturer obtains 
about the realization of uncertainty must come from “costless” announcements 
by the retailers, It 1s easily seen that the set of equilibria and payoffs of 
such an announcement game is independent of the realization of uncertainty — 
at least when preferences are exponential (including risk-neutral and infinitely 
risk-averse preferences). Thus, under the reasonable assumption that the retailers 
coordinate on the same announcement equilibrium (whatever the realization). 
the manufacturer cannot gain by designing such an announcement game 

We also nuled out the possibility of auctioning off the market to one retailer 
affer the realization of uncertainty (this assumption can be justified by the need 
for several retailers, either because of prior capacity investments or increasing 
marginal cost of distnbution or. in a slight departure from the model. by the 
existence of retailers’ goodwill or differentiation) 


functions indexed by their Arrow-Pratt index of absolute 
risk aversion,*? 
uc) 

uc)” 


This family includes the polar cases of risk neutrality (i.e., 
E U(x) = Ex—the retailer cares only about his expected 
profit) and infinite risk aversion (i.e., E U(x) = mint—the 
retailer cares only about his worst possible income). An 
increasing index means that the retailer is more risk- 
averse. Last, all retailers have identical preferences. 

We can now prove the following proposition: 


Proposition 1 The difference between the manufacturer's 
profits under competition and under exclusive territories 
grows with the retailers’ risk aversion. The manufacturer 
imposes exclusive territories for low risk aversion. Suffi- 
cient conditions for the manufacturer to let competition 
play for high risk aversion are that, under monopoly, 
demand and the monopoly price are nondecreasing in the 
demand parameter, and that the monopoly price does not 
respond more than fully to changes in marginal cost 
(6D(p, d)/éd > 0, ép™(c,d)/ed > 0, and ep™c,d)/Ac <1 
for all p, c, and d). (These sufficient conditions are in 
particular satisfied for linear demand D(p,d) = d — p, for 
which the first derivative is equal to 1 and the last two 
derivatives are equal to 4.) 


The proof of this proposition is simple. First, note 
that under competition the manufacturer's profit does not 
depend on the retailers’ degree of risk aversion. This 
ig because retailers are perfectly insuréd (sea ‘subsection 


42. See Arrow 1970 and Pratt 1964. A utility function U, is said to exhibit 
more risk aversion than a utility function U if, for all realizations of x 


= Us (Uj (3) > —Ugny/ Usa) 


The Arrow-Pratt theorem asserts that. for a given distnbution of the random 
vanable z, the certainty equivalent 7, for utility LU, defined by 


Uy) = EUW. 
\s lower than the certainty equivalent Z; for utility Uy. defined by 
Ux,) = EUst2), 


‘An example of such a family is given by Ulz) = ~e°*. where ¢ is the index of 
absolute nsk aversion. 
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4.6.1.1), and thus risk aversion plays no role. (Note in 
passing that, because the retailers make no marginal pro- 
fit, the franchise fee is zero.) In contrast, the profit under 
exclusive territories decreases with risk aversion, since 
retailers bear risk under exclusive territories. From the 
Arrow-Pratt theorem (see footnote 42), the less risk-averse 
the retailers are, the greater is the certainty equivalent of 
their random profit 


max [(p — Pw — )D(p,4)/n — Al, 
» 
where the certainty equivalent is the value z such that 


U(z) 


E uf (maxie — Py — y)D(p, avn) - 4]. 


So, for a given wholesale price (which entirely determines 
the distribution of the random profits), the less risk-averse 
the retailers, the greater the franchise fee can be while still 
keeping equation 4.6 satisfied. Hence, the manufacturer 
benefits from a decrease in risk aversion. 

Under risk neutrality, we know from subsection 4.6.1.1 
that the monopolist makes more profits by imposing ex- 
clusive territories than under competition. This is because 
there is no insurance problem, and exclusive territories 
make perfect use of decentralized information (exclusive 
territories yield the vertically integrated profit) whereas 
competition does not. 

Last, under infinite risk aversion (the other polar case), 
its insurance properties make the competitive arrange- 
ment more profitable for the manufacturer than exclusive 
territories. See the following exercise. 


Exercise 4.7** Assume that the conditions in the last part 
of proposition 1 are satisfied. Show that under retailers’ 
infinite risk aversion, an exclusive territories arrangement 
yields less profit than a competitive arrangement in which 
no franchise is imposed and in which the wholesale price 
is equal to the exclusive-territories retail price in the worst 
state of nature minus the highest possible retail cost 
(The worst state of nature occurs for the lowest possible 
demand parameter d and the highest possible retail 
cost 7.) 


WI 


The conclusion of this study is that competition— 
although it limits the retailers’ reactions to their envi 
ment—may be desirable from the manufacturer's point 
of view. When the retailers are risk-averse, if the manu- 
facturer grants exclusive territories, he must distort the 
intermediate price (p,, > c) in order to bear some of the 
retailers’ risk. The retailers exert a vertical externality on 
the manufacturer through his profit margin when choos- 
ing the retail price (or services, in the other interpreta- 
tion). Thus, the manufacturer would like to control the 
retailers’ behavior. However, he cannot do so, because he 
lacks information about the retailers’ environment. And, 
further, the only kind of franchise fee he can impose is one 
that is not contingent on their environment. In contrast, 
competition, by preventing the retailers from making 
profits, does not expose them to risk. Consequently, the 
manufacturer does not distort the intermediate price to ac- 
count for risk, nor does he worry about being unable to tai- 
lor the franchise fee to each realization of the uncertainty. 


n- 


4.6.1.3 Welfare Analysis 


The main issue with vertical contracts is that, although 
they are usually efficient from the viewpoint of the parties 
who sign them,*? they do not take into account the 
interests of consumers (or, more generally, third parties). 
Hence, there may be externalities, which may have to be 
corrected through public intervention. We saw in the text 
that vertical integration and vertical restraints often exert 
positive externalities on consumers. It was pointed out in 
subsection 4.2.2 that contracts help avoid an excessive 
contraction of output, thus benefitting consumers. The 
picture is quite different in the model of retailer competi- 
tion developed here, however. To see this, let us simplify 
the model by making demand a linear function of price: 


q=Dip.d)=d—p. 
The consumer net surplus is then 


S=Sipd)= [ (d — udu = (d — p)?/2. 


Pp 


Because of the uncertainty on d and ;, we must take the 


43. At least this is the case if, on the date when the contracts are signed all the 
involved parties are symmetrically informed. They clearly should not settle for 
an inefficient contract when they can all be given a higher utility 
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expectation of consumer surplus over d and y: 
ES = Elid — p)?/2] = Md® — p*)? + var(d — p)\/2, 


where the superscript e denotes an expected (or average) 
value of a variable and var denotes its variance. The 
expectation of d, d*, is given. The expected net consumer 
surplus decreases with the average retail price and increases 
with the variance of consumption. 

The aggregate welfare is defined as the sum of the 
expected net consumer surplus and the manufacturer's 
profit. The retailers are always put at their reservation 
utility, so that their utility does not depend on the arrange- 
ment that is chosen. 

Proposition 1 compared the two arrangements from 
the manufacturer's point of view. Proposition 2 deals with 
consumers and aggregate welfare. 


Proposition 2 Suppose that demand is linear: D(p,d) 
= d— p. For any concave utility function U for the re- 
tailers, both the expected net consumer surplus and the 
aggregate welfare are higher under competition than un- 
der exclusive territories. 


This result shows that consumers do prefer competition. 
Furthermore, it shows that when the retailers’ risk aver- 
sion is low, the manufacturer imposes a socially undesir- 
able restraint on intrabrand competition (aggregate wel- 
fare is higher under competition). The intuition for this 
result** is that, under competition, the expected retail 
price is lower and the variance of consumption is higher 
Why is this so? 

First, uncertainty creates a difference in expected retail 
prices between the two arrangements. In a deterministic 


44. For a prool. see Rey and Tirole 1980b 


45. Hence, the argument that consumers prefer competition because it reduces 
the retailers margin 1s fallacious It does not take account of the adjustment in 
wholesale price between the two arrangements 


46. The retail price ts p, + ;. and the franchise fee must equal zero. Hence, the 
manufacturer maximizes 


Eip, — lid —(p. + yi) = ip. — ld — 


‘Therefore p, and the average retail price (p, + ;*) are not affected by the 
introduction of uncertainty around the means 7 and d* 


47. The retailer ex post maximizes 
(pape — ld = pee — A. 


which yields 


environment, the two-part tariff accompanied by either 
competition or exclusive territories is a sufficient restraint, 
as was shown in sections 4.2 and 4.3. Thus, in the absence 
of uncertainty, the retail price is equal to the vertically 
integrated retail price in both cases.** Introducing un- 
certainty around the mean of d and } affects neither the 
wholesale price nor the expected retail price under retailer 
competition, because retailers are perfectly insured.*® Un- 
der exclusive territories, however, the retailers bear some 
tisk, and the manufacturer decides to share some of this 
risk by setting p,, > c (whereas in the deterministic case 
the retailer was made the residual claimant, as p, was set 
equal to c; see section 4.2). This rise in the wholesale price 
is partially passed into the retail price (see chapter 1). 
Hence, under uncertainty the average retail price is higher 
with exclusive territories. 

Second, consumption has a higher variance under com- 
petition. To see this, recall that the competitive retail 
price reacts fully to cost disturbances and not at all to 
demand disturbances (p = p, + 7). Hence, consumption 
(q =d— p=d—p, — 7) reacts fully to both types of 
uncertainty. In contrast, under exclusive territories, mo- 
nopoly pricing implies a partial adjustment of the retail 
price and, thus, only a partial adjustment of consumption 
to both types of uncertainty.*” 

We thus conclude that consumers prefer retailer com- 
petition on this count as well.*® Indeed, their preference 
for competition is so strong that even when the manufac- 
turer prefers exclusive territories, a social planner will 
want to prohibit this restraint on intrabrand competition. 

A very interesting analogy (suggested by Michael 
Whinston) can be drawn with third-degree price discrimi- 


jal ad 


48, When the retail price is observable. RPM becomes enforceable (although 
not necessarily optimal for the manufacturer). It can similarly be shown that. for 
linear demand, the expected net consumer surplus and aggregate welfare are 
higher under competition than under RPM 
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nation in the case of retailer risk neutrality and demand 
shocks. First, because of risk neutrality, the manufacturer 
captures the retailers’ expected profit, even under exclu- 
sive territories (in which case the monopoly retailers face 
a wholesale price equal to the marginal cost c). Second, 
the difference between exclusive territories and competi- 
tion is exactly that between third-degree price discrimina- 
tion and uniform pricing. Exclusive territories yield the 
monopoly price for each specification of the demand 
parameter d. (Think of demand d as a group of consumers 
with proportion equal to the probability of parameter d.) 
The outcome is thus the third-degree-price-discrimination 
outcome. In contrast, under competition the manufacturer 
“fixes” a retail price (equal to the wholesale price plus the 
retail cost) that is independent of d. He thus obtains the 
monopoly profit under uniform pricing. Thus, for retailer 
risk neutrality and demand shocks, proposition 2 is noth- 
ing but Robinson's theorem that uniform pricing socially 
dominates third-degree price discrimination for linear de- 
mands (see subsection 3.2.2.1). This analogy also high- 
lights the role of the linearity of demand in proposition 2. 

This model exhibits one destructive aspect of competi- 
tion, That competition reduces industry profit is more 
general, as will be emphasized throughout part II. Another 
destructive effect of competition was actually developed 
in subsection 4.3.2, where competition reduced the sup- 
ply of pre-sale services; however, there rent dissipation 
was socially wasteful (hurt the consumers)—it is not here, 
as the consumers enjoy increased retailer incentives. 


4.6.2. Market Foreclosure 


Few topics in industrial organization are as controversial 
as market foreclosure. Very loosely, market foreclosure 
are commercial practices (including mergers) that reduce 
the buyers’ access to a supplier (which we will call up- 
stream foreclosure) and/or limit the suppliers’ access to a 
buyer (downstream foreclosure). There are many tools used 
to achieve market foreclosure. A buyer may purchase a 
supplier or set up his own production unit so as to manu- 
facture the intermediate good internally; or he can fill at 
least some of his requirements internally (tapered integra- 


tion). The upstream division may then refuse to deal with 
external buyers or, equivalently. may engage in a “price 
squeeze” (i.e., charge them an exhorbitant price). A sup- 
plier may sign exclusive-dealing or exclusive-territory 
contracts with his buyers. A manufacturer of two comple- 
mentary products may impose a tie-in or make his basic 
good incompatible with the complementary goods sold 
by other manufacturers. 

Though market foreclosure is a “hot” issue among those 
concerned with antitrust proceedings and with regulation, 
economists still have a very incomplete understanding of 
its motivations and effects. Nor can they always success- 
fully explain why a particular tool is employed to achieve 
foreclosure. A general survey of the theory of market 
foreclosure would be very premature. The following sub- 
sections are meant only to introduce some of the issues at 
hand. 


4.6.2.1 Generalities on Market Foreclosure 


For convenience, let us distinguish two kinds of market 
foreclosure. In the first, one of the sectors (upstream or 
downstream) is already monopolized. The goal is then for 
the monopoly supplier or buyer to exploit its monopoly 
power efficiently. The exclusion of trading partners, or 
the reduction of competition among these partners, may 
serve this purpose. In the second kind of market fore- 
closure, neither sector is monopolized, and foreclosure 
increases the monopolization of one of the two sectors. 


Foreclosure as the Efficient Exploitation of Existing Monopoly 
Power 


We already discussed reasons why a monopoly supplier 
may want to deal with a single buyer.*? In the model 
of Rey and Tirole (see subsection 4.6.1) the manufac- 
turer may impose exclusive territories on his retailers (or 
wholesalers). The explanation was that retail competition 
destroys profit and therefore does not allow the manufac- 
turer to enjoy full monopoly power. (The counterpart of 
this is that retail monopoly power destroys downstream 
incentives.) Such behavior is an instance of upstream fore- 


49, The issue of foreclosure may also anse in the case of a monopoly buyer 
We saw in the chapter on the theory of the firm that a buyer may want to keep 
several suppliers or sources for incentive purposes. On the other hand. effi 


ciency may require that fixed costs not be duplicated, so that the buyer creates 
4 monopoly supplier (i. engenders downstream foreclosure) 
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closure. The manufacturer de facto denies the competing 
retailers fair access to his good. 

Another motive for upstream foreclosure is price dis- 
crimination. We saw in chapter 3 that a monopoly manu- 
facturer whose intermediate good is used in two down- 
stream markets has an incentive to integrate into the 
high-elasticity market and to charge a high intermediate 
price to the firms supplying the low-elasticity market. 
Joskow (1985) notes that discrimination often motivates 
a “price squeeze.” Considering a situation in which a 
monopoly supplier is integrated downstream, he defines 
the occurrence of a price squeeze as the situation in 
which “the monopoly input supplier charges a price for 
the input to its downstream competitors that is so high 
they cannot profitably sell the downstream product in 
competition with the integrated firm” (p. 186). That is, the 
manufacturer charges a lower (internal) price to his down- 
stream division than to the downstream competitors. In 
the above price-discrimination example, the manufacturer 
need only drive his downstream competitors out of the 
high-elasticity market. The notion of a price squeeze was 
introduced by Judge Learned Hand in a price-discrimina- 
tion context (in the Alcoa case—see chapter 3).°° 

The same motives may apply in the case of comple- 
mentary products. As was shown earlier in this chapter, 
producing complementary products is very similar to 
operating a vertically integrated firm. A typical example 
is the problem of competitive access in the rail industry, 
where not infrequently the route from location A to loca- 
tion B (called the bottleneck) is owned by a railroad mono- 
polist while the route from location B to location C is 
served by the same monopolist as well as by a rival firm. 
Assume that there are customers who desire to ship from 
A to C, and that using the monopolist from A to B and 
his competitor from B to Cis ruled out by costly handling 
of the merchandise. To serve such customers, the com- 
netitacs trains musk use the managalist’s tracks fram AL 

to B. One way of regarding this situation is to think of 
the bottleneck transportation as the upstream good— 
produced monopolistically—and the competitive trans- 


portation as the downstream good. We can then use the 
previous framework. The issue is whether the monopolist 
and its competitor should freely negotiate the transfer 
price to be paid by the competitor for the trackage rights 
from A to B. Should the Interstate Commerce Commis- 
sion prevent foreclosure, if any, of the access to the bot- 
tleneck route, or should it (more generally) regulate a “fair 
transfer price’? Does the monopolist value competition 
sufficiently (see Baumol! 1983, Grimm and Harris 1983, 
and Tye 1986a, b)? A similar situation arose prior to 1984 
in the telephone industry. AT&T, an integrated firm, had 
a monopoly on the local calls but was in competition with 
other firms (e.g., MCI and Sprint) on the long-distance 
market. Its long-distance competitors had to be connected 
to AT&T's local network, however. One important regu- 
latory issue was to determine the transfer price charged to 
AT&T's competitors for access to this network.*! 

Beyond foreclosing the market through refusal to deal 
or engaging in a price squeeze, the manufacturer of two 
complementary products who has monopoly power on 
one of them can use two other tools: tie-in and incom- 
patibility. For instance, International Salt, which had sub- 
stantial monopoly power on machines that inject salt into 
canned products, was challenged by antitrust authorities 
for tying the complementary product (salt), which was 
produced more competitively, and IBM was accused of 
making some of its products incompatible to gain a com- 
petitive edge. (See Ordover and Willig 1981 for an analy- 
sis of predatory product incompatibility. See also example 
8 of chapter 8.) 


Monopolization 


Suppose now that both the upstream and the downstream 
industries are oligopolistic. It is often alleged that in such 
a situation market foreclosure tends to create monopoly 
power, either upstream or downstream. For instance, in 
‘the Interstate Circuit case (mentioned in Krattenmaker and 
Salop 1986), a Texas movie-theater company obtained 
promises from film distributors that the distributors would 


50, For other examples of such price discrimination, see Joskow: (1985) on 
discrimination by privately owned electric utilities and Scherer (1980, p. 325) 
on discrimination by IBM. Howard (1983, pp. 151-54) also discusses price 
squeezes, Ordover et al (1985) shed further theoretical light on price dis- 
crimination and squeezes. 
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31. This issue did not disappear with the 1984 divestiture Although the 
operating companies in charge of the local networks are now independent 
of AT&T. regulatory constraints (such as equal access for long-distance com- 
panies) must ensure that the operating companies do not duplicate through 
contracting the market foreclosure that might arise under integration. For more 
on the telephone industry, see Brock 1981, Evans 1983, and Temin 1987 
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raise their prices to competing exhibitors. An extreme 
case of such behavior is Alcoa’s reported purchase of ex- 
clusionary convenants from power companies not to sell 
to other aluminum producers (while they did not even 
sell to Alcoa). Such moves are part of a general strategy 
to monopolize the downstream market by raising rivals’ 
costs (Krattenmaker and Salop 1986; Salop and Sheffman 
1983).5? 

The threat of monopolization through mergers or ex- 
clusionary vertical restraints has always been of concern 
to antitrust authorities. For instance, the merger between 
Brown Shoe (which accounted for 5 percent of U.S. shoe 
production) and Kinney Shoe Stores (which represented 
1.6 percent of U.S, shoe distribution) was considered 
illegal. Similarly, exclusive dealing arrangements have 
been frowned upon. The Standard Fashion case of 1922 
involved the exclusion of 40 percent of all dress-pattern 
outlets from dealing with a dress manufacturer's rivals. 
(The critical market share covered by an exclusive dealing 
agreement with a manufacturer has been narrowed over 
time.**) More recently, there has been concern over com- 
puter reservation systems, which are rented by airlines 
to travel agents (in particular, the systems promoted by 
American Airlines and United Airlines cover 70 percent 
of the travel-agent system).°* 

Progress has recently been made toward formalizing 
the effects of a particular type of market foreclosure— 
vertical integration—on the competitive structure of the 
downstream and upstream industries and on welfare (see 
Greenhut and Ohta 1979; Groff and Perry 1985; McAfee 
and McMillan 1986; Salinger 1984, 1986; Krattenmaker 
and Salop 1986; Ordover et al. 1987; Vassiliakis 1985; 
Ordover and Saloner 1987). For instance, in Vassiliakis 


52. The Krattenmaker-Salop paper contains a fascinating discussion of various 
ways of foreclosing access to suppliers or buyers, as well as examples of such 
behavior (some of which are quoted below) 


53. See, eg. the Standard Stations case, 337 US. 293 (1949). For more on 
exclusive dealing, see Marvel 1982 and Mathewson and Winter 1985 


54. The display bias becomes particularly important when airlines compete on 
the same routes. It is also asserted that independent vendors of computer 
reservation systems are at a disadvantage (they cannot bias displays in their 
favor, because they are not integrated). and, similarly. that small carriers are at 
a disadvantage (American may not bias too much against United because of the 
threat of retaliation). Little theoretical work has been done on these issues. First, 
entry into the computer-reservation-system industry is made excessively diffh- 
cult by exclusionary contracts, Not only are natural switching costs (learning, 
new printers, etc.) for the travel agent non-negligible; in addition, the airlines 
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1985 the upstream firms face a fixed cost of production: 
entry is otherwise free (see chapter 7). Integrated produc- 
tion by a downstream firm is costly because the fixed cost 
of production is amortized on too small a number of units 
(it is assumed that an integrated firm does not sell or 
buy on the intermediate-goods market). But integration 
prevents the double marginalization considered in the 
text; the independent suppliers play a Cournot game (see 
chapter 5) and therefore price above marginal cost, which 
introduces a distortion in the downstream choice. From 
this trade-off, Vassiliakis obtains an equilibrium fraction of 
integrated firms and goes on to derive interesting com- 
parative-statics results.*° 

Because these issues refer to strategic considerations 
studied in part II of this book, no attempt will be made 
here to summarize this interesting and growing literature. 
Also, much work remains to be done. First, in the set-up 
where there are integrated and nonintegrated firms, it 
remains to be explained why integrated firms do not sell 
or buy on the intermediate-goods market.*° If they in- 
deed participate in this market, the foreclosure argument 
may have to be reconsidered. Similarly, the set of feasible 
contracts for nonintegrated firms (linear pricing, two-part 
pricing, etc.) ought to be spelled out, if possible, from 
primitive economic assumptions such as arbitrage, in- 
formation problems, or transaction costs. Second, as is 
usually the case with vertical restraints, careful considera- 
tion must be given to alternative instruments. For in- 
stance, if vertical mergers for foreclosure are to be pro- 
hibited, can't the concemed firms duplicate the merger 
allocation through a contract?*” Third, it may be useful 
to study the dynamics of market foreclosure. For example, 
in the cement industry, acquisition of a producer of 


impose important liquidated damages or buy-out penalties for switching, and 
the contracts are exclusive and run for four to five years (on this. see subsection 
46.22). Second, each computer reservation system has a display bias that 
favors the airline selling it, and offers instantaneous confitmation of reserva- 
tions and boarding passes for that airline 


55. For instance, as the demand side of the economy 1s replicated so that the 
number of firms goes to infinity. the fraction of integrated firms goes to zero, 
but welfare does nat converge to the social optimum 


56 Exceptions include Salinger 1984 and Ordover et al. 1987. 


57. If not, the distinction is likely to stem from incomplete contracting and the 
allocation of residual nghts of control, as is explained in the chapter on the 
theory of the firm 
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ready-mixed concrete by a cement company may trigger 
similar behavior by other cement companies (the band- 
wagon effect).*® These puzzles are challenging. Their solu- 
tion will require a solid understanding of both contract 
theory and strategic competition; their importance makes 
one hope that much progress will be made. 

Certain cases of horizontal foreclosure have a similar 
flavor. For instance, in the Aspen Ski case, the firm that 
controlled three of the four major downhill skiing facili- 
ties in Aspen, Colorado, refused to market access to all 
facilities jointly with the owner of the fourth facility. This 
is a case of refusal to deal when network externalities create 
gains from trade. (Recall IBM's attempts to make its pro- 
ducts incompatible with those of other manufacturers.) 
We will return to the formalization of this issue in chapter 
10.5° Similarly, refusing to engage in joint ventures may, 
to some extent, foreclose the access of some firms to a 
new technology. 


4.6.2.2 Contracts as Barriers to Entry 


It is often alleged that long-term leasing contracts or 
contracts that impose substantial penalties for breach 
foreclose access to the downstream market for enter- 
ing suppliers. Such reasoning underlies the decision of 
Judge Wyzanski that the United Shoe Machinery Corpora- 
tion (which controlled 85 percent of the shoe-machinery 
market) had attempted to prevent entry into the market 
by offering binding leasing contracts to shoe manufac- 
turers. This decision was criticized by the prominent 
Chicago School proponents Posner (1976) and Bork 
(1978), who made the point that customers of United 
Shoe Machinery Corporation might not have signed con- 
tracts that would have strengthened this supplier's mo- 
nopoly position. (In the Chicago tradition, Posner and 
Bork attribute such contracts to efficiency reasons.) 
Aghion and Bolton (1987) reconsider the entry-deter- 
rence doctrine of market foreclosure and show that, de- 
spite the buyers’ concern for monopolization, long-term 
contracts may be signed that inefficiently deter entry. 


The Aghion-Bolton model is simple. Initially, a seller 
(the “incumbent’) and a buyer contract for the delivery of 
one unit of an intermediate good. The buyer has a unit 
demand for the intermediate good and has valuation 1 
{altematively, he could buy a backstop substitute at price 
1). The seller's cost of producing this unit is }. After the 
contract is signed, a competing supplier (the “entrant,” 
unidentified at the date of the contract) will come up with 
a cost c of producing the good. Assume that c is @ priori 
uniformly distributed between 0 and 1. (There is no loss 
of generality in assuming that an entrant appears; a high 
c¢ corresponds, de facto, to no entry.) Assume, further, that 
cis known only to the entrant. 

Consider first the optimal allocation for the integrated 
structure imposed by the incumbent and the buyer. Be- 
cause the incumbent's cost (4) is lower than the buyer's 
valuation (1), the buyer always consumes one unit of the 
intermediate good. The issue is only to minimize the 
vertical structure's expected production cost. The trade- 
off is between producing internally at cost 4 and procur- 
ing from the entrant at price p. This is a simple monopoly 
pricing problem. The vertical structure minimizes 


pProb(c < p) + 4Prob(c > p) = p? + 3(1 — p), 


because the price offer from the incumbent, p, is accepted 
if and only if it exceeds c, and the distribution of c is uni- 
form. This yields p = }. In the same way that monopoly 
pricing leads to a welfare loss (consumers with valuations 
above the marginal cost do not buy—see chapter 1), the 
vertical structure's price offer to the entrant yields an 
inefficient production structure: When the entrant's cost 
lies between 4 and 4, the incumbent supplies the inter- 
mediate good. 

The next question is whether the nonintegrated struc- 
ture can realize the same outcome through a contract. 
To see that this is the case, suppose that the incumbent 
makes a take-it-or-leave-it offer of the following contract 
to the buyer: The buyer can buy from the incumbent at 
price 3. The penalty for breach (liquidated damages) is }. 
Suppose that the entrant makes the buyer a take-it- 


58. According to Scherer (1980. p. 90) “When Vertical integration induced by 
fewness of supply sources thins the market further. other buyers may be 
stampeded into integrating as well despite appreciable scale economy sacrifices 
Dynamics of this sort can be observed in the histones of the US and European 
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refrigerator-freezer industries, the U.S. automobile industry, and the movement 
by American steel makers into iron ore mining.” 


59. Other cases include the expulsion from a wholesale purchasing cooperative 
in Northwest Stationers and the exclusion of nonmembers in Associated Press (see 
Krattenmaker and Salop 1986). 
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or-leave-it offer (for simplicity, we put the bargaining 
power on the entrant's side in its negotiation with the 
buyer). The buyer will accept such an offer only if it does 
not exceed j — } = }. Thus, a profit-maximizing entrant 
makes an offer if and only if his cost is lower than $, and 
the offer made is exactly equal to . Hence, the allocation 
and the vertical structure's expected profit are the same 
as in the integrated case. But we still have to answer 
Posner's and Bork’s concern that the buyer may not want 
to accept a contract that helps the incumbent monopolize 
the intermediate-good market. For this, we must specify 
what happens if the buyer rejects the incumbent's con- 
tract. There are several ways of specifying this outcome. 
Fortunately, the precise specification chosen affects only 
the price to be paid to the incumbent and the penalty for 
breach, but not the difference between the two. That is, 
it affects the way the profits are divided between the 
incumbent and the buyer, but not the real allocation. Let 
us assume that if c <4, the entrant enters and Bertrand 
competition between the two suppliers drives the price to 
a Bertrand price of } (see chapter 5). If c > 4, the entrant 
is the high-cost supplier and does not enter; the incum- 
bent then charges the monopoly price, 1, to the buyer. 
Thus, the expected price paid by the buyer, if he does not 
sign the contract, is 


Probtc < 4) x $+ Prob(c > 4) x 1 


=Gx4+64x I=}. 

But accepting the contract has the buyer pay 3 whatever 
c (once the penalty for breach is included), so the buyer 
accepts the exclusionary contract.°"*! 

We thus conclude that the buyer and the incumbent 
can realize the vertically integrated outcome through a 
long-term contract that specifies a penalty for breach in 
case the buyer switches to another supplier. As we have 
seen, this contract creates an inefficiently low probability 


00. Ever it the mcumbent has all the bargaining power in his negotiation with 
the buyer lie makes a takeit-orleave it offer! he cannot realize the whole 
integrated structure's profit. which is 4. His profit is only (3. because the 


buyer can threaten to get 4 by rejecting the contract 


61. As was noted above the exact division of surplus and the final allocation 
of production depend on the relative bargaining powers. and on the assumption 
about price competition in case of rejection. But the incentive for the mcumbent 
and the huyer to exploit their monopoly power with respect to the entrant 
remains 
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of entry from a social point of view. There is too much 
market foreclosure (too little competition).®* 

Aghion and Bolton then analyze the externalities among 
several buyers. (To abstract from downstream competi- 
tion, let us assume that the buyers are in different indus- 
tries.) If the entrant produces under constant returns to 
scale (in particular, if he does not face a fixed cost of entry 
or production, ie., a cost that is independent of the scale 
of operation), the buyers are “unrelated” and the contract 
signed between the incumbent and one buyer exerts no 
externality on the other buyers. The contract is then the 
same as in the one-buyer case. When the entrant faces a 
fixed cost, however, a buyer who signs a contract with 
the incumbent that reduces the entrant's profitability of 
dealing with the buyer (because of penalties for breach) 
reduces the size of the entrant’s market and thus reduces 
the probability of entry. Because the fixed cost can be 
recovered only through mass production, a buyer's exclu- 
sivity or long-term contract exerts negative externalities 
on the other buyers. 

An obvious observation is that the incumbent may be 
able to extract large surpluses from the individual buyers, 
which he could not do if the buyers colluded. To see why, 
suppose that there are many small buyers, and suppose 
that the entrant's fixed cost is sufficiently large that entry 
to supply a single buyer is never profitable, Then if all 
the other buyers accept a contract tying them to the 
incumbent, an individual buyer has no threat to turn to 
the entrant. The incumbent can then obtain the natural- 
monopoly price (equal to 1). In general, however, the 
incumbent may offer a contract that allows the entrant to 
enter with some probability (as in the one-buyer case), 
and can do even better than a natural monopoly by ex- 
tracting some of the efficiency gains from letting the 
entrant produce (as long as the buyers do not organize to 
synchronize their decisions) 

An issue not analyzed in Aghion and Bolton 1987 but 


62, There are reasons why it may be socally efficient to bias the recontracting 
process in favor of the incumbent In particular t can be shown that, both 
socially and privately, the initial contract should favor the incumbent at 
the contract-renewal stage (relative to a bidding procedure) if the incumbent 
invests in transferable assets (e.g, machines or know-how that can be used by 
the entrant in case of entry). see Laffont and Tirole 1987. This result does not 
contradict the conclusion of Aghion and Bolton (1987) that there 1s too little 
switching in equilibrium, but st may: make its application more difficult 1n cases 
Involving non-negligible transferable investments (how distant is the private 
bias from the socially optimal one?) 
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which could in principle be analyzed in an extended ver- 
sion of their model is that of the optimal timing of con- 
tracts for the incumbent. This might shed some light on 
the validity of the common allegation that established 
suppliers optimally deter entry through staggered con- 
tracts with their downstream customers (as is done with 
computer reservation systems). 

Last, Aghion and Bolton show that an incumbent who 
has private information about the probability of entry 
(presumably because of superior knowledge of the tech- 
nology) tends to lower the penalties for breach (i.e., to 
offer “shorter-run” contracts) relative to the case in which 
the buyer is as well informed as the incumbent. The idea 
is that the incumbent tries to push the buyer to accept his 
terms by signaling that the probability of entry is small 
(ie, that the distribution of c is biased toward high costs). 
The way to signal a low probability of entry is to impose 
a low penalty for breach in order to “prove” that one is 
not concerned about the possibility of entry. 
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Answers and Hints 
Exercise 4.1 
Start with the retailer: 


max(p — px)D(p) 
» 


implies that 
D(p) + (p — py)D'(p) = 0, (1) 


which defines a function p*(p,.) (assuming that the objec- 
tive function is concave). We have 


[2 D'(p) + (p — py)D"( p)ldp* — D'(p)dpy. = 0. 
Hence, using equation 1, we obtain 


dp* 1 
5 =3= DOD'UD Dn: (2) 
dp, 2—Dip)D"(p/D2(p) 

Hence, dp*/dp,, is greater or less than 3 if the demand 
function is convex or concave. 

The manufacturer maximizes (py — c)D(p*(p.)): 

dp* 
ap, 


D(p) + (pu — ©)D'(p) =0. (3) 


Equations 1-3 yield the desired result 
Exercise 4.2 


(i) To obtain the equivalence, let firm 1 be the manufac- 
turer and let firm 2 be the retailer. p, stands for the 
wholesale price py: P — Pui pis the price of the final 
good; c, is the unit manufacturing cost; c, is the unit retail 
cost (zero in the text). 

(ii) The vertically integrated price is 


£ 


PTT 


(see chapter 1). 

(iii) Firm 2, knowing p,, chooses p, (or, equivalently, 
p =p, + p2) so as to maximize 
(ps — €2)D(p) = |p — (pr + 2)1D(p) 


Thus, everything is as if firm 2 faced marginal cost p, + c 
(recall the analogy with the vertical structure here). From 
chapter 1. we know that firm 2 chooses 
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Let p(p,) denote this final price. (c, and ¢ are constant, so 
we do not include them in this function.) We have 
oe 

dp, 1—W/e' 


any increase in the first price is magnified by the second 
distortion. Firm 1 maximizes 


(pr — )D(p(p,))- 
Hence, by the chain mule of differentiation, 


(p, —¢)D'p' + D=0. 


Straightforward computations lead to 


a c 
P< — ep 


(iv) In the simultaneous-choice case. firm i chooses p; so 
as to maximize 


(p, — ¢:)D(p + pj), 

where p; is considered as given (j # i), Thus, 
(pj) — &)D'+ D=0. 

Adding these equations for the two firms gives 
(y—OD' +2D=0, 

or 


c 


PTW. 


Note that p > c/(1 — 1/e)? 
Explanation 


Consider the price charged by firm 1 in the simultaneous 
mode. If in the sequential mode firm 1 charges the same 
price, then firm 2 will also choose the same price as in the 
simultaneous mode (since its price is optimal given firm 
1's). Now consider a slight decrease in firm I's price. If 
firm 2 did not change its price, firm 1's profit would be 
affected only to the second order (because firm 1's price is 
optimal given firm 2's). But this also has an effect on firm 
2's price, which is reduced (see iii). This effect increases 
demand, and thus increases firm 1's profit, as p, — c, > 0. 
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Hence, a small price decrease is profitable for firm 1. This 
type of reasoning will be developed more systematically 
in chapters 5 and 8. 


Exercise 4.3 


Clearly, the manufacturer must impose a minimum quan- 
tity: q = q™. The retailer maximizes 


Ip — Pw — O(s)|D(p, 5) 


subject to the constraint D(p,s) > q™. In order for the 
manufacturer to realize the vertically integrated profit, the 
solution to this program must be p = p™ and s = s™ (so 
that the vertical structure's profit is maximized), and p,, 
must be equal to p — ®(s) = p™ — (s™) (so that the 
retailer's profit is appropriated by the manufacturer). Con- 
versely, suppose that the retailer maximizes 


lp — p™ + O(s™) — O(s)]D(p,s) 


subject to the constraint D(p,s) > q™. If we forget about 
the constraint, the retailer will choose p and s such that 
D(p,s) < q™, because he faces a cost p™ — ©(s™) for the 
intermediate good that exceeds the vertical structure's 
marginal cost ¢. Hence, in the absence of the constraint, 
the retailer would induce a lower demand than q™. (The 
proof of this is almost identical to that for the result in 
chapter 1 that the monopoly price increases with the 
marginal cost.) Thus, the constraint is binding: D(p,s) 
= q™. The maximization program is now equivalent to 
max[p — (s)] subject to D(p,s) = q™. However, from 
the maximization program for the vertically integrated 
structure, the solution to this program is p = p™ and 


s=s™ 
Exercise 4.4** 


On multilateral moral hazard and the role of third par- 
ties, see Holmstrom 1982. 

(i) The marginal revenue for the manufacturer is p™ 
— ®(s™), who chooses the right amount of services, S™, if 
the retailer chooses p™ and s™. The retailer buys the inter- 
mediate good at marginal cost. Hence, he ought to choose 
p™ and s™. The presence of the third party allows both the 
retailer and the manufacturer to be residual claimants. 
(The role of a third party in breaking the budget balance 
between two parties was particularly emphasized by 
Groves [1973] and Holmstrém |1982}.) To appropriate the 
retailer's profit. the manufacturer should set 
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A=[p™—¢ — O(s™)|D(p™,s™, S™). 
The third party makes a zero profit: 
—[p™ — @(s™)]qg™ + cq™ + [p™ —c — O(s™)]q™ = 0. 


(ii) The third party is a marginal source, in that a unit 
increase in output yields him 


c+ Os") — p™ <0. 


(The third party's margin is equal to minus the vertically 
integrated margin, which is not surprising.) The manufac- 
turer and the retailer, if they collude, have a joint profit 
margin of 


[p—c — Os)] + [p™ —c — D(s™)], 


which exceeds the joint profit margin in the absence of a 
source: 


p—c— 6) 


Hence, collusion between these two parties leads to an 
output above q™. and—because marginal units are costly 
to the source—to a negative profit for the source. 


Exercise 4.5 


Another explanation was given in chapter 3: The tie-in of 
spare parts or maintenance can be used as a counting 
device to discriminate between high-use and low-use con- 
sumers. For a more general discussion of tie-ins and their 
link with the input substitution model, see Shughart 1985. 

(i) Informally: The rental cost of the capital (the jet) 
equals the interest on the durable good, rp,, plus the 
product of the probability that the good breaks down, 
x(x’), and the replacement cost, p,,. To this must be added 
the maintenance cost, c'x'. Thus, the marginal cost of one 
unit of output is 


minje'x' + pale + x(x')|} 
More formally: If the downstream industry buys the dura- 
ble good, the expected total cost (gross of maintenance 
costs) of owning one unit of the durable good permanent- 
ly is 
C=py+ { wee" Cat 

0 


The owner must pay p, now. The probability at date 0 of 
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the durable good’s breaking down between ¢ and t + dt 
is 2e-*. The owner then faces an expected total cost of C 
anew. Integrating, we obtain 


rC = p,(r + 2). 


Thus, again, the cost per unit of time is py(r + «) + c’x'. 
(ii) The monopolist sells at p, > c. The first-order con- 
dition for the downstream industry is 


c+ pyt'(x') = 0, 

and that for the vertically integrated structure is 

c+ ca(x') =0. 

Because 2” > 0, x’ is smaller under vertical integration. 
(The result that cost minimization is destroyed by exces- 
sive maintenance downstream was obtained by Schma- 
lensee [1974]. In contrast with a sale policy, the rental 
policy, yidlds:eod minimization. and ig(preferred By the 
monopolist.) Sufficient restraints include a tie-in and a 


royalty. 
(iii) Introduce a fictitious variable, 


RSet ar) 


(Note that dz/dx’ < 0 and d*x/dx'* > 0.) Thus, the down- 
stream industry minimizes 


ex’ + pyx(z’), 


In the input-substitution problem in the text, the down- 
stream industry minimizes c’x’ + p,x subject to f(x, x’) 
= constant (a cost-minimization program). The constraint 
can be written as x = x(z') with dx/dx’ < 0 (inputs are 
substitutes) and d?x/dx'? > 0 (the production function is 
concave), 


Exercise 4.6 


(i) A franchise fee is not sufficient. Whatever the mar- 
ginal wholesale price p,,, competition generally does not 
lead to the vertically integrated level of services. (Actual- 
ly, A= 0 because competition drives retailers’ profits 
down to zero.) 

(ii) RPM is sufficient. The choice of p and p,, fixes the 
competitive level of services: ®(s) = p — py. Hence, it 
suffices to impose p=p™ and charge the wholesale 
price p, = p™ — (s™). Quantity fixing is also a sufficient 
instrument. 
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Exercise 4.7 


Let p™(p, + 7,4) denote the monopoly price in state of 
demand d with total input price p, + y, where p,, denotes 
the wholesale price under exclusive territories. The manu- 
facturer’s profit under exclusive territories is 


El(py — ID p™ py + 7.4), 4)] 
+ [p™(Py + Fd) — py — FID p(y + 7, d).4) 


The second term in this profit represents the sum of the 
franchise fees over all retailers. (Because the retailers are 
infinitely risk-averse, their utility is equal to their profit in 
the worst state of nature minus the franchise fee; and this 
utility must be non-negative.) 

Because we assumed that the monopoly price responds 
less than fully to variations in cost, 


P™( Pw + Fd) — Pp py HYD <SI— 3 


and that the monopoly price is nondecreasing in the de- 
mand parameter, 


p™ py + 7d) <p py + 7d), 
we have 


Dip" (pe + id). d) < Dip p, + Fd) — 


+7.) 


The snanifachirer's profit under exclusive territories is 
clearly bounded above by that under the alternative com- 
petitive arrangement, which is equal to 


EID p™ (pre + Fd) — 7 + dp py, + 7d) — 
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Part II 


Strategic Interaction 


Introduction 
Price and Nonprice Competition 


In an oligopolistic market structure, a firm no longer en- 
counters a passive environment. Therefore, we need to 
incorporate the strategic interactions of various decision 
makers in our models. To do this, we will make extensive 
use of the theory of noncooperative games. 

Firms can use many instruments to compete in a mar- 
ket. Grossly simplifying, we can classify these instruments 
according to the speed at which they can be altered. In 
the short run, price is often the main instrument that a 
firm can change easily (other instruments include adver- 
tising and sales-force effort). Therefore, we begin our 
analysis with price competition in the context of rigid 
cost structures and product characteristics. In the some- 
what longer run, cost structures and product character- 
istics can be altered, either together or separately. Produc- 
tion techniques can be rearranged and improved upon; 
capacity can be increased. Product characteristics (quality, 
product design, delivery delay, location of outlets, and so 
forth) can be changed. The consumers’ perception of the 
product, which influences the demand function, can be 
modified by advertising. Ultimately, there is the decision 
of whether or not to enter or stay in the market (a “0-1” 
choice). Finally, in the long run, the product characteristics 
and the cost structures may be changed, not only by 
simple adjustments within the existing set of products 
and feasible costs, but also by a modification of this set. 
Research and development allows firms to expand their 
choice sets. “Process innovation” alters the technological 
production possibilities, and “product innovation” affords 
the creation of new products. 

We can very crudely schematize the different stages of 
competition as in figure 1. 

Chapter 5 deals with short-run price competition, ex- 
amines the Bertrand paradox (in which two or more 
identical firms producing a homogeneous good with a 


st MT ut 


Figure 1 


Research and 
development 


Determination 
of cost and 
capacity 


constant-returns-to-scale technology in equilibrium sell 
at marginal cost and make no profit), discusses why 
the Bertrand conclusion is disturbing, and suggests three 
factors that, in practice, smooth price competition. The 
effect of capacity constraints is studied later in the chap- 
ter. Chapter 6 looks at repeated price competition, and 
chapter 7 introduces product differentiation. The last 
three chapters deal with barriers to entry, accommoda- 
tion, predation, and exit (chapters 8 and 9) and with 
competition in research and development and the adop- 
tion of new technologies (chapter 10). 


Noncooperative Games and Strategic Behavior 


We shall model oligopolistic behavior as noncoopera- 
tive games in which each firm behaves in its own self- 
interest. We are especially interested in the equilibria of 
these games. Nash equilibrium is the basic solution con- 
cept in game theory. A set of actions’ is in Nash equi- 
librium if, given the actions of its rivals, a firm cannot 
increase its own profit by choosing an action other than 
its equilibrium action. For example, take two firms (the 
analysis generalizes trivially to n firms). Firm i (i = 1, 2) 
eams profit II‘(a;,a,), where a; is the action of firm i and 
a; is the action of its rival. We say that a pair of feasible 
actions is in Nash equilibrium if, for all i and any feasible 
action 4;, 


TI'(aj, a5) > 11a, a}). (1) 


The strategies we study here are pure strategies; each 
firm chooses a simple action. We could also consider 
mixed strategies where each firm chooses randomly from 
a set of actions. Of course, in order for firm i to be willing 


to choose randomly from this set of different actions, all 
actions in it must yield the same profit (or expected profit, 
if firm j also plays a mixed strategy), and this profit must 
be optimal relative to the set of feasible actions a;. 

Nash equilibrium generalizes naturally to dynamic situa- 
tions and to problems of incomplete information. We 
shall consider first the dynamic concept of Nash equilib- 
rium (designated “perfect” in the jargon of game theory). 
This idea becomes particularly important as soon as there 
are many time periods and any intertemporal dependency 
of profits or feasible action sets; that is, when the actors 
make choices in period f that affect their objective func- 
tions or their set of feasible choices in a future time period 
t+ #, where t’ > 0. To determine the consequences of 
actions taken in t, the players must forecast what will 
happen in f + t' given the state of the game at the begin- 
ning of that period (which is influenced by their actions in 
t). To calculate these expectations, each player assumes 
that all other players will play an optimal strategy in 
t+’, Therefore, the solution of a dynamic game is “back- 
ward looking.” For example, in a two-period game, we 
start by solving the second-period Nash equilibrium as a 
function of the state of the game at the beginning of the 
second period (that is, on the basis of what happened 
during the first period). This means that the players can 
determine the future consequences of their first-period 
actions, because their first-period actions determine which 
second-period equilibrium will ensue; in a sense, given 
their first-period actions, the remainder of the game is a 
foregone conclusion. Therefore, the players choose their 
first-period actions with an eye toward their consequences 
in both periods. Thus, it suffices to determine the Nash 
equilibrium of the corresponding game in which players 
take only first-period actions but with the same set of 
consequences as in the original two-period game. All this 
may appear rather abstract, but it will become much clearer 
with some examples. 

The Nash concept also generalizes to situations of 
asymmetric information. For instance, a firm may ex ante 
have one of two cost structures and be the only party to 
know which of these two is realized. The other parties 
must then figure out how this firm plays optimally for 
each possible realization of the cost structure. The no- 


1. For expositional simplicity, we blur the distinction between action and 
strategy here. See the Game Theory User's Manual for more details. 
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tion of Bayesian equilibrium shows precisely how Nash 
equilibrium may be extended to this type of situation. 
Finally, in dynamic games with asymmetric information, 
the notions of perfect equilibrium and Bayesian equilib- 
rium can be combined to further extend the relevance of 
Nash equilibrium. 

Because most problems of industrial organization can 
be solved with a handful of basic game-theoretic con- 
cepts, it is recommended that the reader develop at least 
a casual familiarity with game theory. Although most of 
the arguments in part II can be understood at an intuitive 
level, the reader will benefit from a formal acquaintance 
with the concept of Nash equilibrium and its extensions, 
in the same way that optimization techniques clarify the 
study of the exercise of monopoly power. The reader 
may find the Game Theory User's Manual (chapter 11) 
useful in this respect. 

Does noncooperative game theory remain relevant in 
situations in which firms appear to collude? In industrial 
organization, as in other fields, collusion and noncoopera- 
tive behavior are not inconsistent. First, an altruistic party's 
objective function may embody the objectives of another 
party. In such a case, the first party's own interest is to 
make decisions that help the other party. (Here, altruism 
means cooperative actions taken purely for reason of self- 
interest.) Second, in the absence of altruism, parties facing 
conflicts may wish to change the rules of the game they 
are playing if this game has disastrous consequences for 
them. Signing a contract is a way of doing so. For in- 
stance, duopolists may agree to share the market in order 
to avoid cutthroat competition. However, signing a con- 
tract is formally only a part of a bigger noncooperative 
game. These two reasons why collusion can emerge from 
self-interested behavior may have limited relevance in IO. 
First, firms are rarely thought of as altruistic. Second, 
signing collusive contracts to prevent competition is 
often illegal. A third and more important reason is that, 
in a dynamic context, a firm may want to “pull its 
punches” because an aggressive action would trigger a 
rational reaction or retaliation from its opponents. (This 
will be emphasized in chapter 6 and, to a lesser extent, in 
chapter 8.) Again, the collusion is only apparent; it re- 


sults from optimal noncooperative behavior. (This type of 
collusion is sometimes called tacit collusion.) 


Reaction Functions: Strategic Complements and 
Substitutes 


Consider a simultaneous-move game between (for sim- 
plicity) two firms. Assume that each action belongs to the 
teal line and that the profit functions TI‘(@,,a;) are twice 
continuously differentiable in the actions. The (necessary) 
first-order condition for a Nash equilibrium is that for 
each firm i 


Taz, at) = 0, (2) 


where a subscript denotes a partial derivative (e.g., TT! 
= @M'/éa;). The second-order condition is that a, = at 
yields a local maximum: 


ii(@7,.aj) <0. (3) 


Assume that each firm's profit function is strictly concave 
in its own action everywhere: ITj;(a;,a;) < 0 for all (a;,a;). 
Then the second-order condition is satisfied and, further- 
more, the first-order condition given in equation 2 is 
sufficient for a Nash equilibrium, A Nash equilibrium is 
then given by a system of two equations with two un- 
knowns (equation 2). 

Let us define R,(a;) as the best action for firm j given 
that firm j chooses a;: 


TH(R(a,),a,) = 0. (4) 


a; = R,(a;) is unique from our strict-concavity assump- 
tion,? and is called firm i's reaction to a;. A Nash equi- 
librium is a pair of actions (aj,a3) such that at = R, (a3) 
and a3 = R,(a%). In such an equilibrium, each firm reacts 
optimally to the other firm's anticipated action. 

A crucial element of part II is the sign of the slope 
of reaction functions for the various strategic variables 
we consider, This slope is obtained by differentiating 
equation 4: 


(5) 
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2. We will assume that it exists and is an interior solution. In other words, 
going to the boundary of the feasible set of actions (e.g, —00 or +00) is not 
‘optimal for firm i. 
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Figure 2 


We thus have sign(R;) = sign(Ii,). Ij, is the cross-partial 
derivative of firm i’s profit function, ie, the derivative 
of its marginal profit with respect to its opponent's ac- 
tion. The reaction curve is upward sloping if Mj; >0 
and downward sloping if Tj; <0. Following Bulow, 
Geanakoplos, and Klemperer,? we will also consider the 
actions of the two firms to be strategic complements if 
TI; > 0 and strategic substitutes if Ij; < 0.* As we shall 
see further on, prices are often strategic complements, and 
capacities are often strategic substitutes. 

The construction of the reaction functions in a simul- 
taneous-move game, performed in figure 2, is no more 
than a technical and illustrative device. By definition of 
simultaneous choices, a firm chooses its action before 


observing that of its opponent. Hence, it has no possibi- 
lity of reacting, Reaction functions depict what a firm 
would do if it were to lear of a change in its opponent's 
action (which it does not). Points other than the Nash 
point on the reaction curves are never observed. 

In contrast, reaction functions have real economic con- 
tent in dynamic (sequential) games. For instance, if firm 
i chooses a; first and firm j observes this choice be- 
fore choosing a,, firm i can use the function R; to com- 
pute how a change in its behavior affects its opponent's 
behavior. 


3. J. Bulow, J. Geanakoplos, and P. Klemperer, “Multimarket Oligopoly: Stra- 
tegic Substitutes and Complements,” Journal of Political Economy 93 (1985): 
488-511. 
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4. This terminology is inspired by demand theory. Two goods are comple- 
ments for a consumer if a decrease in the price of one good makes the other 
good more attractive to the consumer, Here, a decrease in a, induces a decrease 
in a, if M1, > 0, and conversely for substitutes 
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Short-Run Price 
Competition 


The study of price competition—a fundamental part of 
oligopoly theory—is one of its weakest links. It so hap- 
pens that the most obvious natural formalization yields a 
result that is sometimes unconvincing. Deeper reflection 
shows that this formalization is economically naive, and 
alternative approaches come to mind. 

In this chapter we assume that firms “meet only once” 
in the market. They simultaneously and noncooperatively 
charge a price. The Bertrand paradox, discussed in section 
5.1, states that under these circumstances even oligopolists 
behave like competitive firms—that is, the number of 
firms in the industry is irrelevant to the study of price 
behavior. Section 5.2 offers an overview of the three 
alternative approaches that will be developed below and 
in the next two chapters. Section 5.3 introduces one such 
approach which is associated with decreasing returns to 
scale or capacity constraints; it studies foundations for 
the rival model to the Bertrand paradigm, the Cournot 
model of competition in quantities. The Cournot model 
assumes that firms pick quantities rather than prices, and 
that an auctioneer chooses the price to equate supply 
and demand. This model has been justly criticized on 
the grounds that no such auctioneer exists and that 
firms ultimately choose prices. The point of section 5.3 
is that Cournot competition may be thought of as a 
two-stage game in which firms first choose capacities 
(or, more generally, scale variables) and then compete 
through prices. Section 5.4 reviews the main properties 
of the Cournot paradigm. Section 5.5 discusses concentra- 
tion indices. The supplementary section completes sec- 
tions 5.3 and 5.4 with a discussion of capacity-constrained 
price competition and other aspects of the Coumot 
model. 


5.1 The Bertrand Paradox 


To simplify, let us take the case of a duopoly. The analy- 
sis generalizes straightforwardly to the case of n firms. 


Assume that two firms produce identical goods which are 
“nondifferentiated” in that they are perfect substitutes in 
the consumers’ utility functions. Consequently, consumers 
buy from the producer who charges the lowest price. If the 
firms charge the same price, we must make an assumption 
about the distribution of consumers between them. We 
assume that each firm faces a demand schedule equal to 
half of the market demand at the common price (the half 
is not a crucial assumption). Further, we assume that the 
firm always supplies the demand it faces (this assump- 
tion is not crucial here). The market demand function is 
q = D(p). Each firm incurs a cost c per unit of production. 
Therefore, the profit of firm i is 


TH(p;, pj) = (pe — DiC pi Pa), (5.1) 


where the demand for the output of firm i, denoted D,, is 
given by 


Dip;) if pi, <P; 


Dippy) = 44D p,) if p=; 
to) if p; > pj. 
The aggregate profit, 


min(p; — ¢)D(p;), 
fi 


cannot exceed the monopoly profit, 


T1™ = max(p — c)D(p) 
P 


Each firm can guarantee itself a non-negative profit by 
charging a price above the marginal cost. Hence, any 
reasonable prediction must yield 


o<m'+m <n 


The firms choose their prices both simultaneously and 
noncooperatively. Simultaneity means that each firm has 
not yet observed the other firm’s price when choosing its 
own. Rather, a firm anticipates it. We assume that it does 
so correctly. A Nash equilibrium in prices—sometimes 
referred to as a Bertrand equilibrium—is a pair of prices 
(pt. p) such that each firm's price maximizes that firm's 
profit given the other firm’s price. Formally, for all i = 1, 
2 and for all p,, 


1 (pp, py) > Tp, pp)- 
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The Bertrand (1883) paradox states that the unique 
equilibrium has the two firms charge the competitive 
price: pj = p} = c. The proof is as follows: Consider, for 
example, 


A> RSs 

Then firm 1 has no demand, and its profit is zero. On the 
other hand, if firm 1 charges 

Py =pi—e 

(where € is positive and “small”), it obtains the entire 
market demand, D(p3 — ), and has a positive profit mar- 
gin of 

pee. 

Therefore, firm 1 cannot be acting in its own best interest 
if it charges pj. Now suppose that 

Pi=pm>e. 

The profit of firm 1 is 

D(p})(p} — 0/2. 

If firm 1 reduces its price slightly to pj — «, its profit 
becomes 

Dp — &)(p} —e — 0), 


which is greater for small ¢, In this situation, the market 
share of the firm increases in a discontinuous manner. 
Because no firm will charge less than the unit cost c (the 
lowest-price firm would make a negative profit), we are 
left with one or two firms charging exactly c. To show 
that both firms do charge c, suppose that 


Pi> pr=c 


Then firm 2, which makes no profit, could raise its price 
slightly, still supply all the demand, and make a positive 
profit—a contradiction. 

The conclusions of this simple model are the following: 
(i) that firms price at marginal cost, and 
(ii) that firms do not make profits. 
These conclusions suggest that the monopoly results of 
chapter 1 are very special. Even a duopoly would suffice 


to restore competition. We call this the Bertrand paradox 
because it is hard to believe that firms in industries with 
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few firms never succeed in manipulating the market price 
to make profits.’ 

In the asymmetric case (say, where firm i has constant 
unit cost ¢;, where c, < c,), conclusions i and ii do not 
hold. Indeed, the following can be shown (up to some 
technical considerations, see exercise 5.1): 


(iii) that both firms charge price p = c; (actually, firm 1 
charges an ¢ below c, to make sure it has the whole 
market), and 

(iv) that firm 1 makes a profit of (c; — c,)D(c,), and firm 
2 makes no profit (as long as c, < p™(c,), where p™(c,) 
maximizes (p—c,)D(p); otherwise, firm 1 charges p™(c, )). 


Thus, firm 1 charges above marginal cost and makes a 
positive profit, and the Bertrand equilibrium is no longer 
welfare-optimal. But, again, the conclusion is a bit strained. 
Firm 1 makes very little profit if c, is close to c,, and firm 
2 makes no profit at all. 


Exercise 5.1* Prove conclusions iii and iv. 


5.2 Solutions to the Bertrand Paradox: 
An Introduction 


We can resolve the Bertrand paradox by relaxing any one 
of the three crucial assumptions of the model. Each of 
these generalizations brings more realism to the problem 
of price determination. The first is studied in section 5.3; 
the other two are taken up in chapters 6 and 7. In the 
present section we will sketch these possible solutions. 


5.2.1 The Edgeworth Solution 


Edgeworth (1897) solved the Bertrand paradox by intro- 
ducing capacity constraints, by which firms cannot sell 
more than they are capable of producing. To understand 
this idea, suppose that firm 1 has a production capacity 
smaller than D(c). Is (py, p3) = (c,c) still an equilibrium 
price system? At this price, both firms make zero profit. 
Suppose that firm 2 increases its price slightly. Firm 1 
then faces demand Dic), which it cannot satisfy. Then, 


rationing dictates that some consumers must resort to 
firm 2. Firm 2 has a (residual) nonzero demand at a price 
greater than its marginal cost and, therefore, makes po- 
sitive profit. Consequently, the Bertrand solution is no 
longer an equilibrium. 

To solve explicitly for the equilibrium, we must intro- 
duce a more specific assumption concerning the manner 
in which consumers are rationed, As a general rule, in 
models with capacity constraints, firms make positive 
profit and the market price is greater than the marginal 
cost. The crucial question now is whether or not this 
property is relevant: Won't firms accumulate capital ex 
ante until they are capable of satisfying the entire market 
demand at marginal cost? The answer is No. To accumulate 
capital is expensive, and it is not in the self-interest of the 
firms to do so if such behavior yields zero gross profit 
(without netting out capital costs).? 

Using capacity constraints to justify noncompetitive 
prices is quite reasonable in some applications. For exam- 
ple, imagine the case of two hotels in a small town. In the 
short run, these hotels cannot adjust the number of beds 
(capacity). It is useless for them to get involved in cut- 
throat price competition if they are incapable of satisfying 
market demand individually. In the longer run they do 
not increase their capacity very much, because they ex- 
pect keen competition in a situation of collective over- 
capacity. One can also consider the case where the pro- 
duction of a good requires a certain delay. Then, the 
available quantities for sale in the very short run cannot 
be adjusted at all, and therefore they act as capacity con- 
straints at the time of price competition. 

The existence of a rigid capacity constraint is a special 
case of a decreasing-retums-to-scale technology. In our 
previous example, a firm has marginal cost that is equal 
to c up to the capacity constraint and is then equal to 
infinity. More generally, the marginal cost may increase 
with the output. Except in special cases (such as the hotel 
example), a firm usually has some leeway to increase its 
production beyond its “efficient level”—extra machines 
can be rented, existing ones can be utilized beyond their 
efficient intensity of use, inputs can be provided on short 


1, Another paradox of the model is that one wonders why firms bother to 
enter at all if they do not make any profit. Along the same lines, suppose that 
the firms face a fixed cost of entering the market, Then, if one firm enters, the 
other firm will not follow suit, however small the fixed cost. Thus, if one 
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believes in the existence of at least a small fixed cost of production or of entry, 
the market is likely to yield a monopoly. 


2, See sections 5.3 and 5.7 


Short-Run Price Competition 


notice, and workers can work overtime. The cost of pro- 
ducing these extra units exceeds that for the inframarginal 
ones, but is, in general, not infinite. 


5.2.2. The Temporal Dimension 


The second crucial assumption underlying the Bertrand 
paradox is the “timing” of the game, which does not 
always seem to reflect economic reality as it stands. To 
see this, consider a crucial condition for the Bertrand 
solution. In particular, why is p, = p2 > ¢ not an equilib- 
rium? The answer is that firm 1, for example, would benefit 
from a slight decrease in its price (i.e., to pp — €) and from 
its resulting takeover of the entire market. What would 
happen then? Nothing, given Bertrand’s crucial condition 
that players are assumed to play only once. Firm 2 would 
lose all its customers, and would make zero profit because 
it would not react. In reality, firm 2 would probably 
decrease its price in order to regain its share of the mar- 
ket. If we introduce this temporal dimension and the pos- 
sibility of reaction, it is no longer clear that firm 1 would 
benefit from decreasing its price below p,. Firm 1 would 
have to compare the short-run gain (the increase of its 
market share) to the longer-run loss in a price war. Chap- 
ter 6 shows that more collusive behavior than in the 
Bertrand equilibrium can be sustained by the threat of 
future losses in a price war. 


5.2.3 Product Differentiation 


‘An important assumption in the Bertrand analysis is the 
perfect substitutability of the firms’ products. Consumers 
are indifferent between the goods at equal price and, thus, 
buy from the lowest-priced producer. This creates a pres- 
sure on price, which is somewhat relaxed when the firms’ 
products are not quite identical (see chapter 2 for a de- 
scription of a few differentiation spaces). Then, in general, 
the firms do,not charge their marginal cost. For instance, 
think of two firms selling the same good but located at 
different places. Suppose that firm 1 charges p, = c. Firm 
2, by charging p, = c+ « (for & small), keeps at least 
some consumers who are located near it. For these con- 
sumers, the price differential is more than offset by the 
difference in transportation cost. Hence, the zero-profit 
price system (p, = ¢, p2 = 0) is no longer an equilibrium. 
(An extreme case of product differentiation occurs when 
the demands for the firms’ products are unrelated, Each 
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firm then charges its monopoly price.) Price competition 
with differentiated products is analyzed in chapter 7. 


5.2.4 What to Make of the Bertrand Analysis 


Bertrand competition is interesting because it depicts a 
polar case. It represents what we have in mind when we 
think of sharp small-number competition. In general, of 
course, oligopoly pricing will lead to an outcome inter- 
mediate between the Bertrand one and the outcome of 
the other polar case (the monopoly situation). Most of 
our analysis of price rivalry will concern the determina- 
tion of the factors that induce tough or soft competition. 


5.3 Decreasing Returns to Scale and Capacity 
Constraints 


5.3.1 Rationing Rules 


Assume now that both firms’ cost functions exhibit de- 
creasing returns to scale. C,(q;) is increasing and convex: 
Ci(q)) > 0 and C;'(q;) 2 0 for q, > 0. 

As figure 5.1 shows, an extreme case of decreasing 
returns to scale is that of a capacity constraint: The mar- 
ginal cost of production becomes infinite at some output 
7; (called the capacity level). 

At a given price p, a firm is not willing to supply more 
than its competitive supply 5,(p), which is defined by the 
equality of price and marginal cost: 


p= Ci(S(p)). 


Now, suppose that firm i charges the lowest price, p, and 
that 5,(p) < D(p). Not all consumers who want to buy 
from firm i are able to do so. Because of rationing, the 
other firm faces some residual demand. The exact form 
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of the residual-demand function depends on which con- 
sumers are served by the low-price firm—ie., on the 
rationing rule. 

Two rationing rules have often been considered in the 
literature. The following subsections give their inter- 
pretation for the case of a demand function stemming 
from unit demands. The interpretation for individual 
downward-sloping demands is left to the reader. 

Let p, < p, denote the two prices charged by the two 
firms. By abusing notation, we can allow 7, = 5,(p,) to 
denote firm 1’s supply. (5;(p;) is a self-imposed capacity 
constraint.) 


5.3.1.1 The Efficient-Rationing Rule 


Suppose that 9, < D(p,). Firm 1 cannot satisfy all of 
its demand. The efficient-rationing rule presupposes a 
residual-demand function for firm 2: 


— JSPlp2)— a, if D(p2) > 
Patra) = { otherwise, 


Thus, everything is as if the most eager consumers bought 
from firm 1. Firm 2 then faces the translated demand 
curve shown in figure 5.2. 

This rationing is called efficient because it maximizes 
consumer surplus. In particular, when D(p,) > q,. the 
marginal consumer who consumes the good has valuation 
P2 for the good, which is the marginal cost of obtaining 
the good for consumers. The residual-demand function 
defined by the efficient-rationing rule is the one that 
would be obtained if the consumers were able to cost- 
lessly resell the good to each other (i.e, to engage in 
arbitrage).? 

Efficient rationing is also called parallel rationing, for 
obvious geometrical reasons. It also arises when the de- 
mand curve stems from identical individual downward- 
sloping demand curves and consumers are all rationed 
fairly (i.e., each of n consumers gets 4, /n at price p, ). 


5.3.1.2 The Proportional-Rationing Rule 


By this rationing rule (also called the randomized-rationing 
rule), all consumers have the same probability of being 


ah) 
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Figure 5.2 
The efficient-rationing rule. 
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Figure 5.3 
The proportional-rationing rule. 


3. This is because we know that the competitive exchange market between 
consumers is efficient. Note that the existence of a frictionless arbitrage between 
consumers is a strong assumption. (Recall that on the other side of the market, 
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firms are assumed not to be able to change prices; thus. we may be putting too 
much friction on the supply side and too little on the demand side.) 
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rationed. The probability of not being able to buy from 
firm 1 is 
Dip.) = 4 

Diy) ~ 


Hence, the residual demand facing firm 2 is 


Dip.) = 4) 

Dia) J 

This rule is not efficient for consumers—some con- 
sumers with valuation below pz (the marginal cost of the 
good for the population of buyers) buy the good because 
they obtain the bargain price p,. However, firm 2 prefers 
this rule to the efficient-rationing rule, because its residual 
demand is higher at each price. 


By\p2) = Dip.) 


Of course, there are many other possible rationing 
rules. The most realistic one cannot be found in the ab- 
stract, For example, if rationing is achieved by queueing 
and waiting, we need to know the degree of correlation 
between the value of the good and the value of the time 
for each consumer. If the consumers who are the most 
eager to buy the good are also those with the highest 
value of time, and if rationing is accomplished through 
waiting time, one may want to consider rationing tules in 
which the least eager consumers tend to buy the good at 
the lowest price. One must also wonder whether con- 
sumers can engage in arbitrage after buying the good and 
whether one firm can buy from the other.* In the follow- 
ing, we will use rationing rules as a substitute for a com- 
plete analysis of consumer behavior. 


5.3.2 Price Competition 


Price competition under constant returns to scale yields a 
price equal to the constant marginal cost. The natural 
generalization for decreasing returns to scale would be 
the “competitive outcome”: 


Figure 5.4 
p” = Cig) = Ca(g2), 
or 


S,(p") + Sop") = D(p*). Unfortunately, both firms charg- 
ing the competitive price, p*, is generally not an equilib- 
rium, Consider, for example, the symmetric marginal-cost 
curves depicted in figure 5.4. The competitive-equilibrium 
price is p*, and the competitive supply per firm is 4". The 
rationing rule here is the efficient-rationing one, and each 
firm's residual-demand curve when the other firm charges 
p" is represented by the dashed line. 

Suppose that p, = p2 = p* is a Nash equilibrium. Then 
each firm's profit is equal to the area of the shaded region 
in figure 5.4 (possibly minus a fixed cost of production). 
Now, it is clear from the figure that there exist prices 
p > p’ that yield a higher profit for a firm given that the 
other firm charges p*.5 Thus, the competitive equilibrium 
ts not a Nash equilibrium. The economic intuition behind 
this result is simple. At the competitive equilibrium, each 
firm is on its supply curve, so one will not supply more if 
the other raises its price. The firm that raises its price 
slightly above the competitive price loses some demand; 
however, this is only a second-order effect, as the last 


4. For instance, if consumers are rationed according to the efficient-rationing 
rule, firm 2 would want to buy firm 1's whole capacity if it could do so. The 
residual demand would then be equal to the demand, and firm 2's profit would 
be 

p2D(p2) — pity — elD(pa) — 91) > (Pp2 — DC ea) — Ha) 


for linear costs, say. 


5, Mathematically, a firm's profit for price p > p* when the other firm charges 
pris 


214 


pID(p) — "| — CID(p) — 9°). 

The derivative of this profit with respect to p at p = p* is 

Dip") — 4" + {p —C1D(p") — a7} Dip") = 4" > 0, 

using D(p") = 29". So the profit is locally increasing in one’s own price at the 
competitive price. The reader should check this property for a more general 


rationing rule yielding a residual demand curve D(p,p") that is right- 
differentiable at p = p* (for instance, the proportional-rationing rule). 
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units were sold at marginal cost. At the same time, that 
firm raises the price on inframarginal units and realizes a 
first-order increase in profit. 

Deriving the (or an) equilibrium under increasing mar- 
ginal costs is often a complex matter. In particular, it 
generally involves mixed strategies.® But one property of 
equilibrium is that both firms’ prices exceed the competitive 
price.’ This property formalizes the notion that decreasing 
returns to scale soften price competition. In the following 
example (which differs from the previous analysis in that 
the marginal-cost functions are not differentiable), the 
equilibrium is nicely characterized (the firms charge a 
market-clearing price). 


5.3.2.1 A Capacity-Constrained Example 


Let the demand curve be 
D(p) =1—p, 
or 


p=P@q, +42) =1—4 — 42. 


The two firms have capacity constraints, so firm i’s output 
must satisfy q; <q, The capacity 7, has been acquired 
previous to the price game at a unit cost cg belonging to 
the interval (3, 1]. The marginal cost of production c (once 
the capacity is installed) is, without loss of generality, 
0 up to q, and co after q; (for instance, the capacity 
could correspond to an ex ante production). The efficient- 
rationing rule is in effect. 

We can restrict ourselves to capacities of less than 4, 
because a firm's profit (gross of investment costs) in the 
price game cannot exceed the monopoly profit 


max p(1 — p) = 3. 
? 


Thus, firm i's total profit (net of investment costs) is at 
most 4— coq;, and is negative for 7, > 4—whatever 
their expectations about the market outcome, firms would 
not rationally invest more than 4. (From now on, profits 


will be computed gross of the investment cost except 
where another method is mentioned.) 

Assume that 7, and 9, belong to [0,4]. Let us show 
that both firms charging 
Pr =1-(4, +42) 
is an equilibrium. (This equilibrium is unique.) At this 
price, both firms “dump” their capacities on the market, 
and the consumers are not rationed. There is no point to 
charging a lower price; firm i cannot supply more than 7; 
and thus would simply supply an output equal to capacity 
at a lower price. 

Is it worth raising one’s price above p"? Firm i's profit 
from price p > p* is 
pl —p— 9) = (1 — 4 — 9,4 
where q is the quantity sold by firm i at price p. (Note that 
q <j, because p > p*.) But this later profit is the same as 
the one realized by a firm that gives output q to an 
auctioneer who then equates supply and demand, given 
that the other firm supplies 7. We will later call this profit 
a Cournot profit. The profit function 
1 —4q-—4))4 
is concave in q. Its derivative at q = q, is 
1—29,—7) >0, 
because 7, and q, are lower than 4. Hence, lowering the 
output below @; (or, equivalently, raising the price above 
p”) is not optimal. 

The conclusion of this study is that everything is as if 
the two firms put outputs equal to their capacities on the 
market and an auctioneer equaled supply and demand. 
The difference is that the firms choose the market-clearing 
price themselves. For capacities 7, and 7; in [0,4], the 
firms’ reduced-form profit functions after solving for price 
competition are 


M993) = (1 — Gi + GDM (5.2) 


(gross of investment cost) 


6. This feature makes the static model of price choices particularly suspect. 
‘Suppose that each firm randomizes between different prices. A firm's price may 
be optimal ex ante (before the firm knows the realization of the other firm's 
Price). But ex post (after learning this realization) the firm may want to change 
its price. This suggests that we look at the price dynamics, 
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7. The advanced reader may check this on mixed strategies. The intuition 
can be derived from pure strategies: If p, > p, and p, < p*, then firm 1 is a 
local monopolist (j.e., its profit is the monopoly profit for prices around p,). 
We know that a monopolist does not sell at a price below marginal cost. If 
Ps =p, <p", the same reasoning as in footnote 5 shows that each firm wants 
to raise its price, 
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and 
1", 9,) = {1 — Gi + 7) — coe (5.3) 
(net of investment cost). 


We later will say that such profit functions have the exact 
Cournot reduced form. As was explained above, these are 
the profit functions that would obtain if the firms pro- 
duced quantities 7; and an auctioneer picked the price so 
as to clear the market. 

Exercise 5.2* Let the demand function be 


q=D(p)=1 


Pp. 

Suppose that both firms’ marginal cost (once the capacities 
are installed) is zero. Suppose further that 7, and q2 are 
lower than 4. Show that under proportional rationing both 
firms charge 


pe=1—(% + 92) 
and that 
119, 9)) = Gil — Gi — Gy) 


Our assumption of large investment costs was meant 
to guarantee small capacities. The result that for small 
capacities the reduced-form profit functions have the 
exact Cournot form was obtained by Beckman (1967) in 
the case of proportional rationing and by Levitan and 
Shubik (1972) in the case of efficient rationing. For larger 
capacities, no pure-strategy equilibrium exists (unless 
capacities allow each firm to supply the whole demand at 
the competitive price). The equilibrium is thus in mixed 
strategies, and it was computed in closed form by Beck- 
man (1967) for proportional rationing and by Levitan and 
Shubik (1972) for efficient rationing in the special case of 
symmetric capacities. Kreps and Sheinkman (1983) char- 
acterized the mixed-strategy equilibrium for efficient 
rationing for asymmetric capacities as well—see section 
5.7.8 (The characterization of the asymmetric case is 
important for the study of the two-stage game, in which 
firms are allowed to choose different capacities.) 


5.3.3 Ex Ante Investment and Ex Post Price 
Competition 


In the above example, price competition was subsumed in 
a very simple manner. Each firm knew that both firms 
chose exactly the price that allowed them to dump their 
capacities on the market. As Kreps and Scheinkman (1983) 
noted,’ this suggests that we look at a two-stage game in 
which both firms simultaneously choose capacities 9; and 
then, knowing each other's capacity, they simultaneously 
choose prices p;. 

From our characterization of the second-stage price 
game, it follows that this two-stage game is equivalent 
to the one-stage game in which firms choose quantities 
7; and an auctioneer determines the market price that 
clears the market: p = P@, + 42). This one-stage game 
is indeed the quantity competition game envisioned by 
Cournot (1838).!° Cournot has often (rightly) been criti- 
cized on the ground that prices are ultimately chosen by 
firms, not by an auctioneer. This type of two-stage-game 
construction shows that it may be possible to vindicate 
Cournot by introducing capacity constraints and con- 
sidering the Cournot profit function (equation 5.3) as a 
reduced-form profit function in which later price competi- 
tion has been subsumed. 

Kreps and Scheinkman (1983) have shown that if the 
demand function is concave and if the rationing rule is the 
efficient one (but the investment cost co is arbitrary), 
the outcome (capacity choices, market price) of the two- 
stage game is the same as that of the one-stage Cournot 
game—as the title of their paper proclaims, “quantity 
precommitment and Bertrand competition yield Cournot 
outcomes.” '! (See section 5.7.) 

The idea of looking at a two-stage game in which firms 
choose an investment decision and then a price is not 
restricted to capacity choices. Indeed, in chapters 7 and 8 
we will study two-stage games in which the investment 
decision refers to a choice in a product space (e.g. loca- 
tion). These games will have similar features. For instance, 
we will see that when choosing a location, firms try to 
differentiate themselves from other firms so as to avoid 


8. See also further closed-form computations or characterizations of equilib- 
rium in various contexts by Benoit and Krishna (1987), Davidson and Deneckere 
(1986), Ghemawat (1986), and Osbome and Pitchik (1986). For early work on 
the topic, see Shubik 1959. 


9. Davidson and Deneckere (1986) trace the idea back to Sherman (1972). 
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10. The idea of Nash equilibrium can be found in Cournot. A Nash equilibrium 
for the quantity game is often called a Coumot-Nash or Cournot equilibrium. 


11. However, the reduced-form profit functions do not have the exact Cournot 
form for “large capacities’ (see section 5.7). 
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the intense Bertrand competition associated with per- 
fectly substitutable products (in the same way that firms 
here avoid accumulating “too much capacity” in order 
to soften price competition). Such two-stage games are 
attractive because they formalize the idea that investment 
decisions are generally made before price decisions (or 
that they are long-run or medium-run choices while prices 
are fairly flexible.) 


5.3.4 Discussion 


As was noted above, the two-stage game is meant to 
convey the idea that price competition is the final stage of 
competition and the idea that scale decisions must be 
made before firms arrive on the market. Of course, in 
situations where firms can produce while or after demand 
is formulated, the second condition is not met. However, 
the existence of ex ante choice of scale may be reasonable 
in a number of situations. For instance, our hotel example 
is based on the fact that a hotel cannot adjust its capacity 
very quickly as a function of demand. Similarly, the street 
vendor of a perishable food first goes and buys a certain 
amount of the food (capacity), and in a second stage sells 
part or all of this amount, 

There are two potential results of interest for the foun- 
dations of Cournot competition: 


Exact Cournot reduced form The capacity-constrained price 
game yields reduced-form profit functions that are iden- 
tical to Cournot profit functions, in which quantities are 
to be interpreted as capacities. 

Cournot outcome in the two-stage game The equilibrium of 
the two-stage (capacity and then price) game coincides 
with the Cournot equilibrium, in which quantities are to 
be interpreted as capacities. 


The first result implies the second. And, unlike the sec- 
ond, it allows the analysis of variants of Cournot competi- 
tion, such as sequential timing for the choice of capacities. 
It is important to realize that these results rest on very 
strong assumptions. For instance, Davidson and Deneckere 
(1986) show that rationing mules even slightly different 
from the efficient-rationing rule do not yield the Cournot 
outcome if the cost of investment cy is small, The re- 


sults also rest on the absence of intertemporal price com- 
petition (see chapter 6) and product differentiation (see 
chapter 7).!? 

Furthermore, even when such results hold in this sim- 
ple setting, one must exercise caution when justifying 
Cournot competition using a reduced-form argument in 
more general settings. This point can be seen in a simple 
model where a firm’s actions signal information to its 
rivals. As we shall see in chapter 9, firms may attempt to 
infer from their opponents’ behavior information about 
the cost structures or the level of demand. Now, a one- 
stage quantity game and a two-stage quantity (capacity) 
and price game may be different, because the ability to 
make inferences may vary across different types of ac- 
tions (in a one-stage game, a firm’s actions do not convey 
the same information as the actions in the two-stage 
game it is meant to summarize). At the present point in 
the book, this may be obscure, and the reader may wish 
to retum to this issue after having read the next chapters. 
But the moral is that the assumptions underlying the 
two-stage “vindication” of the Cournot model may not 
be consistent with the application of this theory to any 
given situation. In particular, caution must be exercised so 
that folding a two-stage game back into a reduced-form 
one-stage game does not eliminate important features of 
the two-stage game (e.g., the type of inferences firms may 
draw from their opponents’ behavior), 

Another caveat: In most cases, firms do not face rigid 
capacity constraints, as we noted earlier. The cost func- 
tion induced by the investment choice does not have the 
(inverted) L shape. That is, there is generally no “capacity 
level” that gives a meaning to the quantity variable in the 
Cournot profit functions. 

So, what is left of the analysis in this section? Three 
points: 

First, the predictions and welfare results of the tradi- 
tional Cournot model (see sections 5.4 and 5.7) can be 
provided with foundations in some extreme cases. Over- 
all, the exact-Coumot-reduced-form and the Cournot- 
outcome results are more likely to hold when the invest- 
ment cost cy is high. A high co creates a large discrepancy 
between the first-period (ex ante) costs and the second- 
period (ex post) costs, and therefore creates a higher 


12, They also require capacities to be observable before prices are chosen. If a 
firm's capacity is not observable by its rival(s), scale and price choices must be 
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represented as simultaneous rather than sequential. The results are then modi- 
fied (see section 5.7). 
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willingness to dump existing capacity (i.e, adopt market- 
clearing behavior) ex post. 

Second, the two-stage game illustrates the idea that 
firms may want to choose non-price actions that soften 
price competition (here, each firm restricts its capacity as 
a commitment not to choose a low price later).'* This 
idea is much more general than this particular situation 
and will be developed in much detail in chapter 8. 

Third, in many (descriptive) applications of Cournot 
competition, the ability to write the profit functions in 
the exact Cournot reduced form is not crucial, The main 
property of Cournot competition is often that the cross- 
partial derivative of firm i’s profit with respect to the 
actions of that firm and its rival is negative (strategic 
substitutes). This property holds, for instance, under the 
Kreps-Scheinkman assumptions (in the pure strategy 
region, 


em _ PPG + 4,)) — da 
69,09; 55,09; 


=P'+P'G,<0 


for a concave demand function), but it may also hold for 
models in which the exact Cournot form is invalid— 
although the exact assumptions under which capacities 
are indeed strategic substitutes remain to be determined. 

More generally, what we mean by quantity competi- 
tion is really a choice of scale that determines the firm's cost 
functions and thus determines the conditions of price competi- 
tion. This choice of scale can be a capacity choice, but 
more general investment decisions are also allowable. 

To give an example of this reasoning, let us jump 
ahead. In chapter 7, we will consider price competition 
between two firms located at the two extremes of a 
spatial-differentiation model on a line (see chapter 2 for 
the set-up). Assuming that the segment has length 1, that 
the parameter of differentiation is f, and that the firms face 
constant marginal costs of production c, and cz, we will 
show that, solving for price competition, the reduced- 
form profit functions are 


ake 
Mc,.¢)) = (: + 154) jp 


Now consider a first stage, in which the firms “choose 
their unit costs’—that is, they choose a level of mone- 
tary investment J, that determines the ex post unit cost 
gj(l;) (with ci(-) < 0). These investments (or the resulting 
unit costs) can be considered scale variables by abuse 
of terminology, and they satisfy the strategic-substitutes 
condition: 


am/al,al, <0 (and d°1'/ac,0c; <0). 


5.4 Traditional Cournot Analysis 


We now analyze the one-stage game in which firms 
choose their quantities (understand their capacities) simul- 
taneously. We will use either the general reduced form for 
the profit function, T'(q;,q;), or the more specific exact 
Cournot form: 


11'(Ggi. gi) = iP Gi + 95) — G(qi) 


(see the caveats on this exact form in section 5.3). 

Each firm maximizes its profit given the quantity cho- 
sen by the other firm. Assuming that the profit function 
TI is strictly concave in q; and twice differentiable, we get 


gq = Ri(qj), (5.4) 
where R, is firm i’s reaction curve: 
TH(R,(q;), 4)) = 0. 


Recall from the introduction to part II that if we assume 
that firm i’s marginal profit is decreasing with the other 
firm's quantity, then the reaction functions are downward 
sloping. The equilibrium quantities are depicted in figure 
5.5 by the intersection of the two reaction curves, Of 
course, such an intersection need not be unique; in that 
case we would have multiple equilibria. 

To be a bit more specific, let us consider the first-order 
condition for profit maximization for the exact Cournot 
form: 


TH = Pig, + 45) — Cilqi) + 9iP'(a; + 43) = 0. (5.5) 


This has a simple interpretation. The first two terms yield 


13. Along these lines, Ghemawat (1986) finds that in an example of an 
asymmetric-capacity price game, the firm with a low capacity prices more 
aggressively than the one with a high capacity, because it is more at risk to be 
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undersold. This confirms the idea that a high capacity threatens one's rivals and 
forces them to price aggressively, 
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Figure 5.5 


the profitability of an extra unit of output, which is equal 
to the difference between price and marginal cost. The 
third term represents the effect of this extra unit on the 
profitability of inframarginal ones. The extra units create 
a decrease in price P’, which affects the q; units already 
produced, Equation 5.5 is similar to the formulas obtained 
for a competitive firm and a monopoly. For a competitive 
firm there is no third term, because the firm is too small to 
affect the market price; for a monopoly, q, is equal to the 
output of the industry. 

The preceding comparison actually illustrates the nega- 
tive externality between the firms: When choosing its 
output, firm i takes into account the adverse effect of the 
market-price change on its own output, rather than the 
effect on aggregate output. Hence, each firm will tend to 
choose an output that exceeds the optimal output from 
the industry's point of view'* (but, of course, not from a 
welfare point of view). Thus, the market price will be lower 
than the monopoly price, and the aggregate profit will be lower 
than the monopoly profit. Another interesting consequence 
of equation 5.5 is that Cournot equilibrium does not 
equalize marginal costs except in the symmetric case. Not 
only is too little produced, but the industry's cost of produc- 
Hion is not minimized. 


Equation 5.5 can be rewritten as 


L=-, (5.6) 


P-G 
L= = 


is the Lerner index (price-cost margin) for firm i 

a= 

is firm i’s market share (Q = q, + q,), and 
Pp 

t= —-p2 


is the elasticity of demand. Thus, the Lerner index is 
proportional to the firm’s market share and inversely pro- 
portional to the elasticity of demand. This index is posi- 
tive—that is, firms sell at a price exceeding marginal cost. 
Thus, the Coumot equilibrium is not socially efficient. 

A technical note about the concavity of the firm’s ob- 
jective function and the sign of the cross-partial deriva- 
tive: From equation 5.5, we obtain 


Thi, = 2P' + 4,P" — Cy’ (5.7) 
and 
Thy = P+ 9,P". (5.8) 


Recall that P’ < 0. For the objective function to be con- 
cave (Ij, < 0), it suffices that the firm's cost be convex 
(Cj > 0) and that the inverse-demand function be concave 
(P" <0). The latter assumption suffices for quantities to 
be strategic substitutes (Ij, < 0). These two assumptions 
are met, for instance, for linear demand (P” = 0) and 
constant returns to scale (C;’ = 0). For more on the con- 
cavity of the objective function and the existence of a 
Cournot equilibrium see section 5.7. 

The Cournot equilibrium is easily derived in the case of 
linear demand and cost. Suppose that D(p) = 1 — p (or 
P(Q) = 1 — Q) and C,(q;) = c,q;, Then the reaction func- 
tions are 


1—4j—-G 
4 = Rig) =—4— 4 


Hence, the Cournot equilibrium is given by 


1—2¢ +6; 
n= 3 ' 


14. This is obtained by replacing q,P" in equation 5.5 with (q, + q))P. 
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and the profit is 


_ (I= 26, + 6)? 
a 


Ti! 


A firm's output decreases with its marginal cost. More 
interestingly, it increases with its competitor's marginal 
cost; this is because a higher c, leads firm j to produce 
less, which raises the residual demand faced by firm i, 
encouraging firm i to produce more. 

That a firm's output decreases with its marginal cost 
and increases with its competitor's marginal cost can be 
obtained for more general demand and cost functions as 
long as the following two conditions are satisfied: (a) 
the reaction curves are downward sloping (quantities are 
strategic substitutes) and (b) the reaction curves cross 
only once (there exists a unique Cournot equilibrium) and 
the slope of Rz in the (q,,42) space is smaller in absolute 
value than the slope of R,.!* 

It is easily shown that an increase in a firm's marginal 
cost shifts the firm's reaction curve down. To prove this, 
recall from chapter 1 that a monopolist’s price (respec- 
tively, quantity) increases (respectively, decreases) with 
the firm’s marginal cost. But in duopoly, for a given 
output q,, firm i is a monopolist on the residual-demand 
curve P(- + q,). Hence, the proof of chapter 1 applies, and 
firm i's optimal output given q, is a decreasing (more 
precisely, a nonincreasing) function of firm i's marginal 
cost. This result is completely general; conditions such as 
(a) or (b) are not required. (The reader is advised to go 
through the argument again as an exercise.) 

Figure 5.6 depicts reaction curves satisfying (a) and (b), 
and shows the effect of an increase in firm 1's marginal 
cost. Firm 1's equilibrium output is reduced, while firm 2's 
increases. 

The results generalize straightforwardly to the case of 
n firms. Let 


n 
Q=y 4 
Equation 5.5 then becomes 


P(Q) — Ci(qi) + 4:P'(Q) = 0. (5.9) 


Firm i's Lerner index is still equal to the ratio of its market 


a 
Figure 5.6 


Effect of an increase in firm 1's marginal cost. 


share to the elasticity of demand. For instance, for the 
symmetric case with linear cost and demand, 


P(Q)=1—-Q 
and Ci(q:) = ca 
for all i (with c < 1), equation 5.9 becomes 
1-Q-—c-4,=0. (5.10) 


The equilibrium is symmetric for this symmetric model: 
Q = nq, where q is the output per firm. Hence, we obtain 


nee (5.11) 
Ine 
The market price is 

Sitepg eek (5.12 
sg PSO ee Be te) 
and each firm's profit is 

(—¢? 
na (5.13) 


The market price and each firm’s profit decrease with the 
number of firms, Furthermore, because the market price 
decreases with n, so does the aggregate profit nI1. Indeed, 
when the number of firms becomes very large (n + 00), the 
market price tends to the competitive price c. Thus, a Cournot 


15. This “stability condition” is treated in chapter 8. A sufficient condition is 
that, for i = 1, 2. |Ri| < 1—that is, a decrease in one firm's production yields 
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a decrease in aggregate output even if its rival reacts (optimally) to the decrease 
in production. For instance, for a linear demand, |Rj| = + 
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equilibrium with a large number of firms is approximately 
competitive. This is natural, because each firm has only a 
small influence on the price and thus acts almost like a 
price taker. 

See section 5.7 for more on the convergence to the 
competitive equilibrium, and for a discussion of the exis- 
tence and uniqueness of a Cournot equilibrium. 


Exercise 5.3" There are three identical firms in the indus- 
try. The demand is 1— Q, where Q = q, + gz + q3- 
The marginal cost is zero, 

(i) Compute the Cournot equilibrium. 

(ii) Show that if two of the three firms merge (trans- 
forming the industry into a duopoly), the profit of these 
firms decreases. Explain. 

(iii) What happens if all three firms merge? 

(iv) If the firms were competing in prices and sold 
differentiated products, would a merger between two of 
them be profitable? (Work at an intuitive level, and as- 
sume that prices are strategic complements.) 


Exercise 5.4" Consider a duopoly producing a homoge- 
neous product. Firm 1 produces one unit of output with 
one unit of labor and one unit of raw material. Firm 2 
produces one unit of output with two units of labor and 
one unit of raw material. The unit costs of labor and raw 
material are w and r, The demand is p = 1 — q, — 93, 
and the firms compete in quantities. 

(i) Compute the Cournot equilibrium. 

(ii) Show that firm I's profit is not affected by the price 
of labor (over some range). To prove this elegantly, use 
the envelope theorem. Explain. 


Exercise 5.5* This exercise illustrates the strategic con- 
siderations faced by a multimarket firm. It is inspired by 
the more general theory of Bulow et al. (1985). 

There are two firms in a market. They produce per- 
fect substitutes at cost C(q) = 7/2. The demand is 
p=1—(@ +42). 

(i) Compute the Cournot equilibrium. 

(ii) Suppose now that firm 1 has the opportunity to sell 
the same output on another market as well. The quantity 
sold on this market is x,, so firm I's cost is (q, + x,)?/2, 
The demand on the second market is p = a — x,. Con- 


sider the Cournot game in which firm 1 chooses q, and 
x, and firm 2 chooses q, simultaneously. Show that 
4, = (2—a)/7 and q, = (5 + .a)/21 over the relevant 
range of a. Show that for a = 4 a small increase in a hurts 
firm 1. (Use the envelope theorem.) Interpret. 


ET 
5.5 Concentration Indices and Industry Profitability 


The Bertrand model and the Cournot model are the basic 
models for nonrepeated interaction among oligopolists 
producing the same good. Like the models that will be 
developed in the next chapters, they give prices, quan- 
tities, profits, and consumer surplus as functions of the 
cost structures, the demand function, and the number of 
firms (unless the latter variable is endogenized through 
choice of entry, as in chapter 7). In practice, the observa- 
tion of a market price (if such a thing exists) tells us little 
about the competitiveness of the corresponding industry 
unless we observe prices in industries with similar cost 
structures (for instance, different geographical markets), 
or we observe the temporal pattern of the industry price 
(see chapter 6), or we are able to measure accurately the 
firms’ marginal cost. More readily informative variables 
are the rates of profit and the firms’ market shares. 

Industrial-organization economists have long tried to 
summarize the distribution of market shares among firms 
in a single index to be used in econometric and antitrust 
analysis. Such an aggregate index is called a concentration 
index. With a, = q,/Q denoting firm i’s market share 
(where i=1,,..,m and Ef, 4, = 1), the following are 
examples of concentration indices. 


+ the m-firm concentration ratio (for m <n), which adds up 
the m highest shares in the industry: 


(ordering the firms so that a, >--- >a, >+«- Da), 
+ the Herfindahl index, which is equal to the sum of the 
squares of the market shares'®: 
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16. In the literature, Ry is usually denoted by H. 


Short-Run Price Competition 


+ the entropy index, which is equal to the sum of the shares 
times their logarithm: 


R. = ¥ a|nq,. 


=i 


Of course, such indices should be related to our idea 
of concentration. Encaoua and Jacquemin (1980) give an 
axiomatic derivation of “allowable” concentration indices. 
They require a concentration index R(ot;,...,@,) to satisfy 
the following properties: It must be symmetric between 
firms (invariant to permutations of market shares between 
firms); it must satisfy the Lorenz condition that a mean- 
preserving spread?” (ie., a further spread of the distribu- 
tion of market shares toward its tails) increases R; and the 
concentration for symmetric firms must decrease when 
the number of firms grows from n to n + 1, 


1 1 
& nti n+ ;)). 


They show that a family of concentration indices satisfy- 
ing these properties takes the form 


Ria,,- 


)= ¥ auhta, 

where ht is an arbitrary nondecreasing function such that 
ah(a) is convex. The Herfindahl index and the entropy 
index are two such concentration indices (with h(a) = a 
and h(a) = Ina, respectively). The m-firm concentration 
ratio satisfies these properties, although it does not be- 
long to the family. 

Although the previous requirements seem reasonable, 
they do not tell us what use to make of concentration 
indices. Do they reflect a useful economic variable for 
measurement or policy evaluation? One possibility is that 
they are related to the profitability of the industry. In- 


deed, Bain (1951, 1956) hypothesized that concentration 
facilitates collusion between firms and increases industry- 
wide profits. At this point we cannot assess the (mainly 
dynamic) collusion part of the statement, but we can 
already look at the link between concentration and in- 
dustry profits in light of the static Bertrand and Cournot 
models. Most of the cross-section analysis has actually 
focused on the relationship between concentration indices 
and profitability.'® 

Let us first consider symmetric firms with equal market 
shares. The only reasonable measures of concentration 
are then equivalent to the number of firms in the indus- 
try (ie. the concentration indices are decreasing with 
the number of firms in the industry—e.g., R,, = m/n, 
Ry = 1/n, R, = In(1/n)). The Bertrand model tells us that 
market price and industry profits are independent of the 
number of firms in the industry. Thus, profitability and 
concentration are unrelated. The Cournot model, how- 
ever, exhibits a negative correlation between number of 
firms and profitability (see section 5.4).'® 

When firms have asymmetric market shares (because of 
cost differences, say), there is no longer an unambiguous 
measure of concentration. In some simple cases it can be 
shown that industry profitability is related to a simple 
index of concentration. For instance, following Cowling 
and Waterson (1976), suppose that firms have constant 
marginal costs C; 14), and that they compete in 
quantities. Industry-wide profits are 


£ (p —G))ai = e Pas ro x), 


& Fe 
(5.14) 


where use is made of expression 5.6 for the Lerner index. 
Assume further that consumers spend a fixed amount of 
income on the good—ie., the elasticity « of their demand 


17. See Rothschild and Stiglitz 1970, Atkinson 1970, Kolm 1966, and Kolm 
1969. Consider two industries of equal size with market shares {a,}. and 
{a}fes. Let Re and R, denote the m-firm ratios. Suppose that for all m 
between 1 and n, Ry, > Ra: The aggregate share of the m biggest firms in the 
first industry is greater than or equal to the share of the m biggest firms in the 
second industry, for all m. Then, the concentration index R must be higher for 
the first industry (the Lorenz criterion). This condition can be shown to be 
equivalent to the transfer principle, according to which the transfer of a part of 
a firm's share to a bigger firm must not decrease the concentration index. For 
a given number of firms in the industry, this condition implies that the con- 
centration index R takes on its minimum value when the firms have equal 
shares, and its maximum value when a single firm captures the whole market. 
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18. See chapters 3 and 9 of Scherer 1980 and, especially, Schmalensee 1986 for 
detailed discussions of the empirical literature and references. Most cross- 
sectional analyses find a weak but statistically significant link between concen- 
tration and profitability. Beyond the issue of measurement of profitability, the 
interpretation of this link is complicated by the fact that the relationship is a 
reduced form between two endogenous variables, which, furthermore, can be 
obtained from fairly different models of competition. Other good discussions of 
concentration measurement are Hannah and Kay 1977 (especially chapter 4) 
and Curry and George 1983. 


19. One must be careful here. The number of firms is assumed to be exogen- 
cous, If the number of firms depends on the cost of entry, then a higher cost of 
entry yields a higher concentration but may offset the resulting increase in 
gross profits (see chapter 7) 
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is equal to 1: Q = k/p, where k is a positive constant. We 
then obtain 


n= (% a) = kRy. 


The Herfindahl index then yields an exact measure (up to 
the proportional constant) of industry profitability. 


(5.15) 


Exercise 5.6" 

(i) Show that under constant returns to scale and Cour- 
not competition the ratio of total industry profit to total 
industry revenue is equal to the Herfindahl index divided 
by the elasticity of demand. 

(ii) Show that under Cournot competition the “average 
Lerner index” (2,a;L;) is equal to the Herfindahl index 
divided by the elasticity of demand. 


This chapter suggests that intrinsic asymmetries among 
firms are likely to yield both high concentration indices 
and high industry profitability. Indeed, Demsetz (1973) 
offered this argument as an alternative, noncollusive rea- 
son for Bain’s hypothesis about the positive correlation 
between the two variables. For instance, in Bertrand com- 
petition with constant marginal cost, the lowest-cost firm 
charges a price equal to the second-lowest cost, takes all 
of the market (yielding the highest possible concentration 
index, whatever index satisfying the Encaoua-Jacquemin 
axioms is chosen), and makes a positive profit. With sym- 
metric firms, concentration is generally not so high and 
the firms make no profit. The following exercise also 
gives a (Cournot) example in which exogenous increases 
in cost asymmetries create a positive relationship be- 
tween concentration indices and industry profit. 


Exercise 5.7" Suppose that demand is linear (Q = 1 — p) 
and that there are two firms with constant marginal costs 
c, and cz such that ¢, + ¢, = 2c (where c is a constant). 
Show that when the firms become more asymmetric (c; 
moves away from c) Cournot competition yields a higher 
concentration index and a higher industry profit. 


We will not investigate the generality of Demsetz’s 
conclusion. (For more on the Demsetz argument and for 
some tests, see Schmalensee 1987.) The intuition behind 
the positive correlation in the above examples is clear: 
Cost asymmetries yield output asymmetries, increasing 
the concentration index. At the same time, they allow 
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low-cost firms to enjoy a rent, thus increasing industry 
profit, 

One would also want to relate concentration and wel- 
fare. In the symmetric case, the number of firms is un- 
related to welfare for Bertrand competition and positively 
telated to welfare for Cournot competition. With asym- 
metric firms, a given concentration index need not be 
related in a systematic manner to welfare for either type 
of competition (in the same way that it need not be 
related in a systematic manner to profitability), 


Exercise 5.8°" Dansby and Willig (1979) have proposed 
a look at the effect of small changes in a firm’s output on 
aggregate surplus (consumer surplus plus industry profit). 
Assume that, for some unspecified reason, firm i's output 
moves from q; to q; + 54; (for all i). 

(i) Argue that the change in total surplus, 5W, is equal 
to 


Y (p— Gdqi. 
A 

(ii) Suppose that a change dq = (5q,,...,6q,) must be 
constrained to be smaller than a given number in Euclidean 
norm: 


fn 
Y (6m)? <k. 

Fi 

Show that for Cournot competition the upper bound on 
6W associated with this change is proportional to the 
square root of the Herfindahl index. Discuss. 


Concentration indices are useful in that they give an 
easily computable and interpretable indication of how 
competitive the industry is. However, they have no sys- 
tematic relationship with economic variables of interest 
for assessing changes in cost, demand, or policy. Further- 
more, they are endogenous, so they do not allow simple 
observations of correlation to be interpreted in a causal 
way. 


5.6 Concluding Remarks 


Price competition among even a few firms yields, accord- 
ing to Bertrand, competitive (socially optimal) outcomes. 
However, price competition is softened when the firms 
face sharply rising marginal costs (capacity constraints, in 
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an extreme case), when they compete repeatedly, or when 
their products are differentiated. This chapter has treated 
the first softening factor. 

If firms choose their capacities before competing in 
price, under strong assumptions, they ex post choose the 
price that an auctioneer would pick so as to clear the 
market (ie., to adjust demand to existing capacity). This 
result provides some foundations for the Cournot model, 
in which firms choose quantities and an auctioneer then 
chooses the price so as to clear the market, as long as 
quantities are reinterpreted as capacities. Thus, the Ber- 
trand and Cournot models should not be seen as two rival 
models giving contradictory predictions of the outcome 
of competition in a given market. (After all, firms almost 
always compete in prices.) Rather, they are meant to de- 
pict markets with different cost structures. The Bertrand 
model may be a better approximation for industries with 
fairly flat marginal costs; the Cournot model may be 
better for those with sharply rising marginal costs. 

One should be cautious when justifying the Cournot 
model by the existence of capacity constraints. The valid- 
ity of the justification must be checked in each model. 

Last, quantity competition can more generally be seen 
as competition in choices of scale, where a firm's choice of 
scale determines its cost function and thus the conditions 
of price competition. 


—— Tt 
5.7 Supplementary Section: 
Quantity Competition 


5.7.1 Traditional Cournot Analysis: Existence, 
Uniqueness, and Limit Behavior 


This subsection presents an incomplete account of the 
technical analysis and research that has been conducted 
on the one-stage Cournot model (no price competition) 
considered in chapter 5. 


5.7.1.1 Existence of a Pure-Strategy Equilibrium 


Pure-strategy equilibria have attractive properties. First, 
they are simple. Second, no firm has ex post regret after 
observing the choice of the other firm. So, no further 
adjustment is desired even if firms can change their capac- 
ities. A mixed-strategy equilibrium requires firms to be 
unable to adjust their capacities (even upward), because 
one firm's realization of capacity may not be optimal 
against the other firm's. They are thus more sensitive to 
the possibility of adjustment. The existence of a pure- 
strategy equilibrium has attracted much attention among 
researchers. Two sets of assumptions have historically 
been considered for this purpose. We will consider exis- 
tence in the two-firm case; it is straightforward to ex- 
tend the reasoning to more firms. For simplicity, we will 
also assume that profit functions are twice continuously 
differentiable. 

The first approach assumes that each firm's profit func- 
tion is concave in its own output (see, e.g., Szidarovsky 
and Yakowitz 1977). From the analysis in the text, we 
know that a sufficient condition for this is that the cost 
functions be convex (C;’(q;) > 0) and the demand func- 
tion concave (P” < 0). With concave profit functions, one 
can define continuous reaction functions R,(q)).2° To 
make sure that these reaction functions intersect, one can 
impose the following technical conditions: 


P(O) > C;(0) 


for all i (each firm would like to produce at least a small 
quantity if it were a monopoly) and 


Ry *(0) > Ri(0) = a? 
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20. We will assume that Mj, <0, for simplicity. In this case, reaction functions 
are single-valued. 
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(firm i’s output that induces firm j to produce nothing 
exceeds firm i’s monopoly output). These conditions, as 
well as the strict concavity of the profit functions with 
respect to own output, are satisfied for linear demand and 
constant marginal costs as long as the latter are “not too 
high.” The existence can be obtained from figure 5.7.74 


Remark The existence of a pure-strategy Cournot equi- 
librium is not an issue for an industry with a large number 
of firms. To see this intuitively, recall that the second 
derivative of II! is 


9iP*(Q) + 2P'(Q) — C/'q,). 


Suppose that C;’ > 0, and suppose that (as in the example 
in the text) one increases the number of firms keeping 
demand constant, and that the aggregate output con- 
verges to the competitive one. P’(Q) then converges to a 
strictly negative constant. If g; converges to zero (as in 
the example in the text), I! is clearly strictly concave and 
the previous existence result can be applied. If q; does not 
converge to zero, and competitive equilibrium is obtained 
through the replication of the consumption side or else 
through a reduction of the most efficient scale (see sub- 
section 5,7.1.3), more work is required to obtain this type 
of result. Novshek and Sonnenschein (1978) show that for 
their model there exists a Cournot equilibrium in which 
the proportion of firms using a mixed strategy converges 
to zero when the economy is replicated.?? 


Unfortunately, the profit function is not necessarily 
concave, It may not be, in particular, if the demand function 
is “sufficiently convex.” (See Friedman 1983 and Roberts 
and Sonnenschein 1977 for robust counterexamples to 
concavity of the profit functions and existence of pure- 
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strategy equilibria even with convex costs.) The reaction 
functions need not be continuous (may involve jumps) if 
the profit functions are nonconcave, A second approach 
(McManus 1962, 1964; Roberts and Sonnenschein 1976; 
Vives 1985) proves existence for symmetric firms with a 
convex cost function. The key to the proof is to show 
that the convex-cost assumption implies that jumps in 
the reaction function (which is the same for all firms) are 
jumps up.?? As figure 5.8 indicates, only jumps down are 
a problem when one is proving the existence of a firm's 
output q such that q is an optimal reaction to itself (sym- 
metric pure-strategy equilibrium). 

A more recent approach (Novshek 1985; for a related 
result see Bamon and Frayssé 1985) shows that if a firm's 
marginal revenue increases with the other firm's output, a 
pure-strategy equilibrium exists. 


5.7.1.2. Uniqueness 


Even if it exists, a pure-strategy Cournot equilibrium need 
not be unique (see figure 5,9). One can, nevertheless, find 


21. The more general proof for P* <0 and Cj > 0 is as follows: Define the 
(single-valued) function 4;(Q) by the first-order condition 


P(Q) — Cig) + qP'(Q) = 0, 


cor by 0 if this equation has no solution. Note that 4,(Q) is continuous and 
nonincreasing; so is ",9,(Q). A pure-strategy Cournot equilibrium is obtained 
by finding an aggregate output such that Q = ¥',41(Q), iie.. by finding a fixed 
point of the function Q—+ J..9(Q). The Brouwer theorem asserts that a con- 
tinuous function from a compact set into itself admits at least one fixed point, 
which proves the result. (Compactness is easily obtained by assuming that 
Li4(0) > Oand Y.4(Q) < Q for any Q such that P(Q) = 0.) The equilibrium 
is then unique, because all the functions q)(Q) are decreasing when strictly 
positive. (To see this, it may be convenient to draw a diagram.) 


22. They also show in a general-equilibrium model that the demand curve, 
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which we naturally assume to be downward sloping in our partial-equilibrium 
context, must indeed be downward sloping in equilibrium for a large economy 


23. Assume that P” > 0 (case that creates problems with the first proof of 
uniqueness), Assume for simplicity that costs are nil. Suppose that for q, — 
the optimal reaction is q,, and that for q, + ¢ it is q < 43. where « is positive 
and arbitrarily small, Profit maximization requires that 

92Plqi — © + 42) > 43Plqy — 2 + 93) 


and 
92P(q, + © + 42) > 42Pqs +e + 43). 
‘Adding these two inequalities and using a first-order Taylor expansion in & 


yields a contradiction for P" > 0. Thus, there cannot be any jump down. The 
reader may check the property for general convex cost functions. 
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sufficient conditions for uniqueness. For instance, con- 
sider the two-firm case. Assume that profit functions are 
strictly concave in own output. The differentiation of the 
first-order condition 


THRiG;). 4) = 0 
with respect to 9, gives the slope of the reaction curve: 


TH} (Ri(j), 43) | 
i(Ri(qy), 9) | 


A sufficient condition for reaction curves to intersect only 
once is that whenever they intercept, R, is steeper than 
R, (see points A and C in figure 5.9). In tum, a sufficient 
condition for this to hold is that the derivatives of the 
reaction functions be less than 1 in absolute value over 
the relevant range (|Ri| < 1). Thus, |ITi,| > |Ijj| is suffi- 
cient for uniqueness.** This condition is satisfied for linear 
demand and constant returns to scale, for which the slope 
of the reaction functions is }. 


IRitgy)| = 


5.7.1.3 Convergence to the Competitive Equilibrium 


In the text, we saw in a simple example that when the 
number of firms goes to infinity, the Cournot equilibrium 
converges toward the competitive equilibrium. The rea- 
son is that a small firm is more willing to expand output 
than a big one, because the effect of such an expansion on 
the market price is small for a small firm. (Although the 
global effect of a unit increase in q, on p is the same as in 
the monopoly or the duopoly case, most of it is a nega- 
tive externality on the other firms if firm i is small.?*) 
There are several ways of increasing the number of 
firms. The one used in the text is a simple replication of 
the number of firms. When the technology exhibits non- 
constant returns to scale, one may want to replicate the 
consumption side along with the production side to avoid 
noncompetitive outcomes of the type discussed in chap- 
ter 7 below. (For example, replicating the number of firms 
does not make the industry more competitive if high 
fixed costs and a limited consumer market restrict the 
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Figure 5.9 
Multiple Cournot equilibrium, 


number of viable firms.) This is done in Gabszewicz and 
Vial 1972, for instance. One may also allow “free entry” 
from the start (the number of potential firms is infinite, 
and the number of operating firms is limited by the exis- 
tence of a fixed cost or increasing returns to scale over 
some range and by the extent of the market) and either 
replicate the consumption side or lower the minimum 
efficient scale to induce more entry. Though the extensive 
literature on this subject (see Novshek and Sonnenschein 
1978, Hart 1980, Roberts 1980, Mas-Colell 1982, and the 
symposium in the April 1980 Journal of Economic Theory) 
will not be reviewed here, it should be mentioned that 


24. With n firms, the condition 


n2G = iny)>|z nil 
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is sufficient. This condition actually does not require goods to be perfect 
substitutes; see Friedman 1983. 


25. This type of result fits with the Roberts-Postlewaite (1976) result for more 
abstract contexts. which demonstrates the impossibility of economic agents 
manipulating the competitive process in a large economy. 
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much of this literature is concerned with a larger frame- 
work. In particular, starting with Gabszewicz and Vial 
1972, the results are often obtained in general equilib- 
rium.?© Here, we shall content ourselves with a simple 
partial-equilibrium example due to Novshek and Son- 
nenschein (1978). 

Suppose that in the original economy each firm has a 
U-shaped average-cost curve C(q)/q, as in figure 5.10. 
Producing 0 costs 0. The most efficient scale (MES)—i.e., 
the output that minimizes average cost—can be set equal 
to 1 without loss of generality. Let c denote the minimum 
average cost. Let us keep the consumption side constant 
(the demand is p= P(Q)) and reduce the MES. To 
this purpose, we introduce the family of cost functions 
C.(q) = « C(q/a). The MES of C, is a, and the minimum 
average cost is still c,27 

For any «, we allow free entry. There are an infinite 
number of potential firms. They all choose an output 
simultaneously. Of course, for a given a, only a finite 
number of firms will enter (operate) because of initial 
increasing returns to scale. A Cournot equilibrium implies, 
in particular, that all operating firms (i.e., firms that choose 
% > 0) make a non-negative profit and that any non- 
operating firm would make a nonpositive profit were it to 
enter. 

When « tends to zero, firms can enter on a small 
scale—that is, it is possible to produce a small output 
at unit cost c. This induces more competitive behavior, In- 
deed, if an equilibrium exists (as it does—see Novshek and 
Sonnenschein 1978 or the remark in subsection 5.7.1.1), 
the equilibrium total output Q must belong to the interval 
[Q* — 4, Q"], where Q* is the Walrasian output of the 
limit economy: Q* = D(c). To show this, suppose first 
that Q > Q*. Then, 


PQ) <e< a for all q; > 0. 


Thus, operating firms make a negative profit. They would 
be better off choosing q;=0 and making zero profit. 
Now suppose that Q < Q* — a. Consider a nonoperating 
firm i; it makes a zero profit. By entering at scale q, = 2, 


ml 
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it would drive the total output to Q + « < Q* Its profit 
would then be 


«P(Q+ a) —C,(a) > ac — Cate) = a(S) =0 


—a contradiction. Thus, one indeed gets a competitive 
equilibrium in the limit when a converges to zero. 

Hart (1979) pointed out that having a large number of 
competitors is not always necessary to obtain a competi- 
tive outcome. What matters is that a firm be small relative 
to the market for its good so that it is little affected by the 
effect of its decisions on the market price. This point is 
best exemplified in the case of a monopoly producer. 
Suppose that the producer has exactly ten units of such a 
producer's good for potential sale (i.e. he has a capacity 
constraint of 10, and a marginal cost of O before an out- 
put of 10 and infinity thereafter). If there are ten con- 
sumers, with unit demands and valuations 10, 9, 8, ..., the 
monopolist will sell only the monopoly quantity 5. If the 
consumption side is replicated at least ten times (say there 
are at least ten identical islands on which the monopolist 
can sell), the monopolist can sell his ten units at price 10. 
He then introduces no distortion in the economy. Figure 
5.11 gives the intuition for the result in the case of 
a continuously increasing marginal cost. That figure re- 
plicates the consumption side; for an economy of K > 1 
islands, the quantity demanded is q = KD(p). (K plays 
the role of 1/a, in the previous reasoning.) Thus, for 
production q, the market price is 


26. See Hart 1985 for a very useful survey of the general-equilibrium results 
and methodology, 


Clq) Clala) 
27. Min = —— 
ne > (ala) 


is reached for q/a = 1 and is equal to c 
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Gia = C"@) 


Demand (P(@)) 


Marginal revenue 
(P(q) + PCa) 


>= q/K 


Figure 5.11 


P(q/K) = P(@), 


where 7 = q/K is the “output per island.” The cost func- 
tion per island as a function of g can then be written 


Clq) = C(Q)/K = C(KAYK. 


Thus, the marginal cost Cx(q) = C'(K@) shifts to the 
northwest. The monopolist chooses the output per island 
to equalize marginal revenue and marginal cost. When K 
is large, the firm basically operates in the upper part of the 
demand curve, and thus takes the price (approximately 
equal to the highest valuation) almost as given. 

Allen and Hellwig (1986) have studied the capacity- 
constrained price game when the number of firms tends 
to infinity. They assume that each firm capacity is an 
exogenous constant (there is no first stage determining 
capacities) and show that the equilibrium price distribu- 
tion converges (in distribution) toward the perfectly com- 
petitive outcome.?® 


5.7.2 Capacity-Constrained Price Games 


Here we informally consider part of Kreps and Scheink- 
man’s (1983) ingenious construction, which shows that 
under some circumstances the Cournot outcome holds for 


capacity-constrained price games. Specifically, we first set 
up the price game with (rigid) capacity constraints and 
efficient rationing. We show that firms sell up to capacity 
within a region where capacities are not too large, and 
that in the two-stage game the first-period choice of 
capacities ends up in this region and the outcome is 
equivalent to the Cournot outcome. We will then discuss 
the timing of the game. 


5.7.2.1 The Price Game 


Suppose there are two firms (i = 1, 2). Firm i has a rigid 
capacity constraint q,; it can produce any quantity 4; <i 
at unit cost c. It cannot produce more than q;. Assume 
that c = 0 for simplicity. The marginal production cost is 
represented in figure 5.1. The firm sells up to capacity if 
4; = J. Assume efficient rationing.” The demand func- 
tion P is concave (P” <0), and the firms choose their 
prices simultaneously. 

The analysis proceeds as follows: We first consider the 
existence of a pure-strategy equilibrium (i.e., the firms do 
not randomize in their choice of price). We show that 
such an equilibrium exists if and only if the capacities are 
“not too high” (i.e., belong to some region just above the 
origin in the capacity space). The equilibrium in this re- 
gion is such that both firms charge the price at which 
demand equals aggregate capacity. Thus, both firms ba- 
sically dump their quantities in the market, in a manner 
analogous to Cournot behavior (the only difference is 
that firms, rather than the auctioneer, quote the market 
price). The next step in the analysis is to characterize the 
(necessarily mixed-strategy) equilibrium when capacities 
are “high.” This is complex, but a lemma shows that the 
profit of the highest-capacity firm is equal to the Stackel- 
berg follower profit (ie., the profit that this firm makes 
when it reacts optimally to the other firm’s output, which 
is assumed to be equal to its capacity). The analysis of the 
prior choice of capacities is then simple. It is easily seen 
that the Cournot capacities or quantities lead to a price 
equilibrium in the pure-strategy region, and that if a firm 


28. They assume proportional rationing. They find that although there is 
convergence in distribution, there is no convergence in support. That is, monop- 
oly prices persist for any number of firms (but their probability tends to zero). 
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Vives (1986) shows that for the efficient rationing nile, the support of the 
equilibrium price distribution converges to the competitive price as well. 


29, See section 5.3 
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chooses its Cournot capacity the other firm also is best off 
choosing its Cournot capacity. 


lEMMA 1 In a pure-strategy equilibrium, p, = pz 
= P(q, + Gz). Firms sell up to capacity. 


Proof Suppose first that p, =p, = p> P(q, +72). 
Then the price is too high, in that at least some firm 
i cannot sell to capacity: 9; <q. Now, by charging 
p — &, firm i gets all of the market and can sell 9,. Thus, 
for ¢ small, firm i would gain from undercutting (i.e., 
ap <Gdp— 2). IE pp=p2=p< Pq, +92), both 
firms strictly ration their customers. By raising its price a 
bit, each firm would still be able to sell its capacity and 
would make more profit. Last, p; < p; is not feasible: The 
low-price firm always wants to raise its price as long as it 
is capacity constrained; or else p; is firm i’s monopoly 
price at cost c = 0, and firm i supplies the entire demand 
at this price. Thus, firm j makes no profit, whereas it 
could make a strictly positive profit by undercutting to 
p—e. QED. 


The following simple lemma conveys most of the intui- 
tion about the relationship with Cournot competition. Let 
R,(q;) denote firm i's optimal reaction to output q; in the 
one-stage simultaneous-choice-of-quantity game in the 
absence of cost of accumulating capacity: R;(q, jj) ™max- 
imizes q;P(q; + q;). Because the demand is concave, R; is 
single-valued and decreasing (see section 5.4), 


LEMMA 2 In a pure-strategy equilibrium, firm i never 
charges less than P(q; + R,(q,)) in the capacity-constrained 
price game. 


That is, there is no point charging a (low) price that leads 
the firm to produce beyond the optimal reaction to the 
other firm’s capacity (if it can do so). 


Proof Let p; denote the price charged by firm i. If firm j 
charges a price p; > p;, firm i must charge its monopoly 
price, and firm j makes no profit (whereas it could make a 
profit by charging p; — e). If 


Pi = py < PUG; + RiGy)), 

firm i can raise its price slightly and make profit 

(pit ah > Pit 

if it is capacity constrained, If firm i is not capacity con- 


strained, firm j must be; at least one firm must be con- 
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strained, because otherwise they would undercut. So firm 
i's profit is 

p(D( pi) — 45) = giP(qi + 9)) < R(G,)P(RG;) + G,), 
where the inequality comes from the definition of the 
reaction function. If firm j charges a price p,; < p;, firm i's 
profit is 

pi(D( pi) — Gj) 


(or piq if 7; < D(p;) — Gj; but, as noted above, a strictly 
capacity-constrained firm is better off raising its price 
slightly, so we need not consider this case). Since firm i is 
not capacity constrained, we can rewrite its profit as 


4iP(q; + Gj). 

But this is the Cournot profit for output 7, so 
9; = Rilq;) 

by definition of the reaction function. Therefore, 
Pi = P(q; + R,(q;)), as was stated. Q.E.D. 


Lemmas 1 and 2 imply that a pure-strategy equilibrium 
exists only if, for all i, 


Gi < Rilq;). 

To see this, let 7, > R,(q,), but suppose that a pure- 
strategy equilibrium exists. By lemma 1, 

Pi = PCR, + 92). 

Then 

P< P(A; + RG;)), 


which contradicts lemma 2; hence, by contradiction, a 
pure-strategy equilibrium cannot exist. Above either reac- 
tion curve, the only possible equilibrium is a “mixed- 
strategy” one (see figure 5.12). Conversely, if the capac- 
ities lie under both reaction curves, p =p, =p2 
= P(g; +42) is an equilibrium. Lowering the price is 
senseless, because firms cannot sell more, Raising the 
price implies that the quantity sold is smaller than the 
optimal reaction: 


p(D(p) — G5) = 4.P(q: + 9) 
and 


Hh SH < RG). 


Short-Run Price Competition 


qe 


Cournot capacities 
for marginal cost c 


% 


Figure 5.12 


In particular, if the capacities are the Cournot capacities 
(qi, 43). (corresponding to marginal cost c), the equilib- 
rium price is P(qj + 43). More generally, in the pure- 
strategy region, the reduced-form profits have the exact 
Cournot reduced form. 

The latter property does not hold for proportional 
rationing. Suppose that the capacities are the Cournot ca- 
pacities (q}.q3) and that both firms charge p* = P(qj + 93). 
Firm 1's profit for p > p’ is 


D(p") — 93 ) ( a ) 
DI =[pD : 
‘| vol D(p") iii ita 


Hence, firm 1 is best off charging the monopoly price 
(maximizing p D(p)), which, from section 5.4, exceeds the 
Cournot price p*. This suggests that this analysis cannot 
be extended to proportional rationing (and, indeed, it 
cannot). See subsection 5.7.2.3. 

Outside the pure-strategy region, one must look for 
mixed-strategy equilibria. (See Dasgupta and Maskin 1986 
for general results on the existence of mixed-strategy 
equilibria with discontinuous payoffs.) We will not repro- 
duce Kreps and Scheinkman’s equilibrium construction; 
we will but simply characterize equilibrium behavior in 
order to show that investing in capacity beyond the pure- 


strategy region in the first stage is not in either firm's 
interest. 

A mixed strategy for firm i is a cumulative distribution 
of prices F(p;) on some interval | p;,7;].°° In order for 
such a strategy to be optimal for firm i, it must be the case 
that only prices that maximize firm i's expected profit 
are chosen by firm i (that is, all prices that are chosen 
yield the same—optimal—payoff). See the Game Theory 
User's Manual for a discussion of mixed strategies. 


LEMMA 3 In the mixed-strategy region (9; > R,(q;) for 
at least some firm i), the highest-capacity firm (i, say) 
makes a profit equal to its “Stackelberg follower profit”: 


1 = TF(q,) = RG) PG; + R(;))- 


‘The proof of lemma 3 (a sketch of which is given 
below) is long and involved; it should be skipped in a first 
reading. 


Proof (Sketch) Let p, and p; denote the lower and upper 
bounds of the support of firm i’s optimal strategy. First, 
let us show that p, = p2 = p, and that at p each firm 
sells up to capacity or its opponent charges p with prob- 
ability 0. If p, <p), we know by previous arguments that 
p; must be firm i's monopoly price. As the monopoly 
profit is the most firm i can get, it will charge p,; with 
probability 1 and firm j will never make a profit; however, 
firm j could make a strictly positive profit by under- 
cutting to p; — 8, which contradicts the supposition that 
pis the lower bound of the support of firm j’s optimal 
strategy. Second, if firm j charges p with positive prob- 
ability (“plays an atom’), firm i will be better off charging 
p— eif at pit cannot sell to capacity. Thus, by charging 
p, each firm i can sell %, with probability 1. Because p 
is an optimal price,*’ firm i’s profit is pq;. Note that 
p> PQ; + 42). 

Now consider the highest prices, p, and p,. Suppose 
that 7, > pj, or that p, =p; and that firm j charges 7; 
with probability 0. Firm i’s profit is 


PAD(Pi) — Gy) = GiP(qi + Gj), 


30. Note that F, is increasing. Technically. F(-) is required to be right con- 
tinuous—iee., for all p,, 


Fp) = lim F(p). 


‘An atom at pis defined by 
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Fp) > lim Fp). 


en 
“The equilibrium distributions actually have densities, with possibly an atom at 
the upper bound of the support. 


31. By continuity, if p is an infimum rather than a minimum. 
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where gq; is the quantity sold at p;. Hence, q, = R,(qj),?* 
and firm #’s profit is 


TH (qj) = RG,)PG) + RG). 


(The superscript F refers to the fact that firm i is the 
“Stackelberg follower,” i.e., that it reacts to firm j’s choice 
of q,. See chapter 8.) But, in a mixed-strategy equilibrium, 
all optimal strategies for a firm must yield the same profit 
—in particular, 


TFG) = Pv (5.16) 


On the other hand, suppose that 7, > 9}. Then, by charg- 
ing P(g, + Ri(q,)), firm j can guarantee itself T1¥(q;), be- 
cause we are in the mixed-strategy region, so that q, 
> R\(q,), and because 7, < 7; implies that?> 


Pl + Rid] > PUG, + RiG,)I- 
So we have 
Pay > TFG). (5.17) 


Eliminating p, we obtain M*(q,)q, > II*(q,)q;- Simple 
algebraic manipulations** then show that 9; > q4j—a 
contradiction. 

Or, each firm plays an atom at 7 = 7; = pj. However, 
in a mixed-strategy equilibrium, p > p; and, from our 
earlier analysis, 7 > P(q, + qj). Thus, at p each firm is 
unable to sell to capacity with strictly positive probabil- 
ity. Hence, each firm would be better off charging slightly 
less than p than charging p. 

We thus conclude that the firm with the highest capacity 
—say, i (4; > Fj)—earns profit TF (q;). 


To construct a mixed-strategy equilibrium, one can 
look for increasing probability distributions for each firm 
over some (coincident) interval | p,7/] so that each firm is 
indifferent between playing any price in this interval (see 
Kreps and Scheinkman 1983). We will not need to do so. 
Given the above characterization, we need only to know 
that an equilibrium exists; we need not worry about its 
particular shape. 


5.7.2.2 Choice of Capacities 


Let us now add a prior and simultaneous choice of capac- 
ities; let cp > 0 denote the unit cost of installing capacity. 
Let us show that the Cournot outcome 


Gi= a9 = 9"), 
where q** maximizes 
4lPq + 9°") — co — a) 


is an equilibrium (c = 0 here). 

Figure 5.13 illustrates the reaction curves when the 
capacity cost is sunk and when it is not. In the second- 
period price game, the capacity cost is alreay sunk and 
therefore it is irrelevant (bygones are bygones). Each firm 
would like to put more output on the market than it 
would if capacity were yet to be paid for. Hence, the 
reaction curves move upward from the first to the second 
period. In particular, R(q**) > q"*, where R denotes the 
second-period reaction function. 

Suppose that firm i plays q**. Firm j, if it plays 
q < R(q™), gets 


32. If q, > R,(q;), firm i could raise its price toward 
PG, + RG) 


and make a higher profit, Ifq, < R,(q). 4; = a. and 7, = Pq, + 4a) = p, so the 
equilibrium is in pure strategies, a contradiction. “a 


33. The following inequality is due to the fact that reaction curves are identical 
and have slope < 1, as can easily be shown by differentiating the first-order 
conditions for Cournot equilibrium. 


34. Suppose that 9, < q,, which implies g, > R,(q,), and consider 


s=r'Gya— Tada, 
id 

= f° Zamir + Ranide 
de 


= ly IRG)PG + RQ) + 4 RPG + Rdg, 
1, 


where R denotes the reaction function and use is made of the envelope theorem 
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(R(q) maximizes the profit of the firm that reacts to q). Using the first-order 
condition for Cournot competition, we have 


iu 
4= -f RPAR — a)dg 
hs 


If; > RH), then for all q >a, we have Rig) < R@,) <a, <q and thus 
‘A <D. Suppose next that 9, < R(j\). From footnote 32, 4; > R7'(q)). For g in 
RG) A. 


Req) HS RNG) <q. 
Hence, 
ea) 
A< f — RPIR — qh 
SRN GITPG + RG) — RTP + RGD) 
<0, 


as R(@;) is a better reaction to 9, than R~' (jj) 
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Reaction curves when 
capacity cost is 
already sunk (R (-)) 


Reaction curves when 
capacity cost is 
not yet sunk 


Figure 5.13 
alPq + q**) — col < g™*(P(2q™") — col 


where R still denotes the second-stage reaction function. 
If q > R(q”), firm j gets exactly 


TIF(q’™*) = R(q™) {PIR(Q™*) + 4°") — co}. 


But, by definition of q**, q°*(<R(q™)) is the best first- 
period reaction to q**. So, 


TF (q"*) < q**{P(24"*) — col. 


We conclude that the Cournot equilibrium with cost cy 
is the equilibrium in the first-stage capacity game. And, 
from the analysis of the price game, the second-stage 
price is equal to P(2q**). 

To prove uniqueness in the choice of capacities re- 
quires more work, but not much; see Kreps and Scheink- 
man 1983. 


5.7.2.3 Discussion of the Rationing Rule 


Davidson and Deneckere (1986) argue that for virtually 
any rationing rule other than the efficient one, the Cour- 
not outcome cannot emerge as an equilibrium of the two- 
stage game. Their reasoning can be sketched as follows: 
If we let c and cg denote the production cost and the 
capacity-installation cost, the first-order condition for firm 


2's profit maximization in the Cournot game is 
Pg, + 4a)42 + Pla + 42) —€ —% =0 


at 9; = 42 = 4"*. Let p** = P(2q™*) denote the Cournot 
price, and let D(pz|p,) denote firm 2's residual demand 
when it charges price p, > p,. Note that D(p™|p**) = q™ 
if both firms have accumulated the Cournot capacity 
in the first stage (a necessary condition for the Cournot 
outcome to occur), Assuming that D(p2|p,) is differen- 
tiable with respect to p2 to the right of p,, and assuming 
that the two firms have accumulated the Cournot capac- 
ities and charge the Cournot price p™, firm 2's increase 
in profit associated with a slight increase in price above 
p” is proportional to 


A= D(p™|p") + (p™ — )D'(p™"|p™)- 


(Recall that the investment cost is sunk in the second 
period.) 

Following Davidson and Deneckere, assume further 
that, for pz just above p™, 


D(palp™) > D(p2) — 4. 


That is, the residual demand exceeds the one obtained 
with the efficient-rationing rule. The idea here is that if 
rationing is instantaneous and costless, q** consumers are 
served by firm I and the rest tum to firm 2. The worst 
that can happen to firm 2 is that firm one serves the q** 
consumers with the highest valuations. This is precisely 
what occurs under efficient rationing. That is, efficient 
rationing yields the lowest residual-demand curve,** 

Let us actually make the slightly stronger assumption 
that 


1 
D'(p""|p™) > D'(p") = : 
(Pl p™) > DR) = Bra gem 
where the left-hand side refers to the residual-demand 
curve and the right-hand side to the ordinary-demand 
curve. Using the first-order condition for a Cournot equi- 
librium, we obtain 


sente. 
P’(2q"") 


35. There is, of course, a need to return to the microfoundations of rationing 
here. if rationing is not instantaneous and costless, one can think of residual 
demands that are even worse for firm 2 than the one associated with efficient 
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rationing—for instance, consumers may maintain their order with firm 7 in the 
hope of buying at the low price if rationing is not instantaneous. 
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Now, suppose that co = 0. Then, A > 0. Thus, firm 2, 
say, has an incentive to raise its price above the market- 
clearing price at the Cournot capacities. The Cournot 
outcome cannot be an equilibrium of the two-stage game. 
More generally, the same holds if cy is small, and the 
tesidual-demand curve is sufficiently above the one ob- 
tained for the efficient rationing rule. 


5.7.2.4 Discussion of the Timing 


A model of price competition occurring after capaci- 
ty competition reflects the idea that prices adjust more 
quickly than capacities. Thus, it may make sense to con- 
sider capacities as given when prices are chosen. An im- 
portant assumption of the previous analysis, though, is 
that a firm's capacity is observed by its competitor(s) 
before the price stage. It then acts as an indicator of the 
price the firm is about to charge. If capacities are imper- 
fectly observable, this feature disappears and, formally, 
everything is as if capacities and prices were chosen 
simultaneously (although they need not be). 

Gertner (1985) analyzes simultaneous quantity-price 
games. Each firm i chooses a quantity q; and a price p; 
without first observing the choices made by its com- 
petitors. Although Gertner allows decreasing and increas- 
ing returns to scale, let us focus on the simpler case of 
constant returns to scale, in which producing q; costs cq; 
to firm i. And, for simplicity, assume that there are two 
firms. 

It is clear that no pure-strategy equilibrium exists, The 
logic closely follows the Bertrand-Edgeworth spirit. If a 
pure-strategy equilibrium existed, the two firms would 
have to sell at the same price. Otherwise, the low-price 
firm (say, firm i) would have the whole market; knowing 
that the other firm charges a higher price, it would con- 
fidently be able to produce the whole demand at its lower 
price. However, either p; = c and firm i would be better 
off raising its price by at least a small amount or p, > c 
and firm j could make a strictly positive profit by under- 
cutting firm i. Next, in a pure-strategy equilibrium one 
would need pj = p3 = c; if the market price exceeded c, 
each firm could increase its profit by undercutting slightly 
and supplying the whole market. But the competitive 
price cannot be an equilibrium either. At least one firm 


would supply strictly less than D(c) (otherwise, the firms 
would lose money); so the other firm could raise its price 
a bit, still get customers, and make a positive profit. 

Gertner shows that a (unique) mixed-strategy equi- 
librium exists. It is similar to Bertrand’s equilibrium in 
that firms make a zero expected profit.2® It resembles 
Cournot's equilibrium in that the expected price exceeds 
the competitive price c. (This second result stems from the 
fact that firms never charge less than c, and (c,c) is not an 
equilibrium.) The qualitative difference with the case in 
which quantities (capacities) are observed is that a firm 
cannot commit itself not to “flood the market” by choos- 
ing a limited capacity. This raises competitive pressure 
and drives profits down, as in the Bertrand equilibrium, 
The firm that ends up charging the lower price sup- 
plies the whole market and makes a positive profit, and 
the firm with the higher price makes a negative profit 
(it produces and does not sell). The following exercise 
demonstrates the logic of the argument. 


Exercise 5.9°* Consider a two-firm simultaneous quantity- 
price game. Let p denote the supremum of prices at which 
there is a demand: D(f) = 0. Look for a mixed-strategy 
equilibrium. 

(i) Show that both firms make a zero profit. (Hint: 
Consider the lowest and the highest price charged by 
each firm.) 

(ii) Suppose that each firm i plays according to some 
continuous distribution F(p) on [ PiePil (which can be 
demonstrated). Show that each firm produces D(p) when 
it charges p, if the other firm also produces to satisfy the 
demand at the price it charges. 

(iii) Show that F(p) = 1 — c/p for p <j and F(p) = 1 
is a symmetric equilibrium price distribution. Do these 
tesults depend on the rationing rule? 


Both the sequential and the simultaneous quantity- 
price games involve mixed-strategy equilibria. This fea- 
ture is a bit unsatisfactory if one indeed believes that 
prices can be changed much more quickly than capacities. 
Under mixed strategies, one firm generally ends up with 
a higher price than its competitor and faces little or no 
residual demand. Clearly, this firm would like to react and 
revise its price downward to increase its market share. 


36. With decreasing returns (increasing marginal costs), firms make a profit. 
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Thus, mixed strategies call for price dynamics. Indeed, 
when Edgeworth introduced capacity constraints to es- 
cape the Bertrand paradox, he suggested the possibility 
of price cycles rather than the use of mixed strategies. 
Another feature worth studying when capacity precom- 
mitments take the form of production prior to sales and 
when the game in which both capacity and price decisions 
are made is repeated is the possibility of inventories. 


5.7.2.5 Competition for Inputs 


We assumed earlier that the firms on the output market 
have unrelated cost functions. In some circumstances, 
they may compete for the same input(s) for which they 
have monopsony power. A firm's cost of obtaining the 
input then depends on the other firm’s purchasing strategy 
An important feature is that each firm can overbid its 
competitor on the input market and foreclose access to 
the input supply (or at least make this access more expen- 
sive). If the inputs can be identified with capacity (think of 
wholesalers buying crops from farmers, or of final-good 
producers buying machines from suppliers), each firm can 
restrict its competitor's capacity by bidding input supplies 
up. Stahl (1985) assumes that the input-supply industry is 
competitive and posits some other conditions; he shows 
that the outcome of the two stage game in which in the 
first stage firms bid for inputs (capacities) and in the 
second stage they choose prices, is competitive. As in 
Bertrand equilibrium, even two firms producing the final 
good cannot prevent the price from falling to the level at 
which the consumers’ marginal willingness to pay is equal 
to the marginal cost of supplying the final good. 


ann 


Answers and Hints 
Exercise 5.1 


First, there cannot be an equilibrium in which both p, and 
p, are strictly above c, (by the same reasoning as for the 
symmetric case). Second, firm 2 does not charge less than 
C2 (it would make a negative profit if it sold).?” Third, firm 
1 can guarantee itself a profit as close as possible to 
(cz — ¢)D(c2) by charging c. — e (with « small and posi- 
tive). But since the market price (the minimum of the two 
prices) does not exceed c3, this profit is also the highest 
that firm 1 can obtain. There is an “openness problem”: 
Unless one assumes that at common price c; firm 1 gets 
all the demand, there is no equilibrium stricto sensu—firm 
1 will want to choose ¢ as close as possible to, but differ- 
ent from, 0. Such an ¢ does not exist. But this is a technical 
detail. One can define the equilibrium as the limit, so 
p, = ¢z and firm 1's profit is (cz — c,)D(c2).* 

When p™(c,) < cz, firm 1 can charge its monopoly 
price without worrying about firm 2's threat, 


Exercise 5.2 
Suppose that firm 1 charges 
P=1—@Gi +92) Sh. 


Clearly, firm 2 has. no incentive to charge less than p*. 
Suppose that it charges p > p’. The residual demand is 


Thus, firm 2's profit is 


_pioran 
pl n( 7 ). 


This implies that firm 2 must choose p = } (which is the 
monopoly price in the absence of capacity constraint). 
However, p* > 4. From the concavity of its objective 
function above p’*, firm 2 can do no better than to charge 


P 


37. Here we are making the reasonable assumption that firm 2 does not play a 
“dominated strategy.” We thus put some constraint on the Nash concept (see 
the Game Theory User's Manual), 


58, Show that the same result is obtained (still under the assumption that firm 
2 does nat play a dominated strategy) when the firms are free to supply 
whatever demand they face 
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Exercise 5.3 


()q=4=>T! =x. 

(ii) In duopoly, q = 4=> TI’ =3<2x 4. 

(iii) A monopoly would clearly do better: It would 
make an aggregate profit of 4. 

(iv) In price competition, a firm that carries two sub- 
stitute products charges higher prices for these products 
than two separate firms would. This is because, when 
pricing a product, the firm internalizes the reduction of 
demand on the other product associated with a price 
decrease. Now, if there is a third firm producing a third, 
differentiated product, this firm charges a higher price 
for its own product if the other two products are sold 
by a single firm. Here, merging is here a “puppy-dog 
strategy,” in the language of chapter 8. Merging firms 
become less aggressive and, therefore, trigger a less 
aggressive response from the third firm. 

For more details on mergers and Cournot competition, 
see Davidson and Deneckere 1984, Salant et al. 1983, 
and Szidarowsky and Yakowitz 1982; on mergers and 
Bertrand competition, see Deneckere and Davidson 1985. 


Exercise 5.4 


(i) Let cy =w +r and c, = 2w+r denote the unit 
costs. In Cournot equilibriam, 


gy = (1 — 2c, + €2)/3 and q, = (1 — 2c, +. ¢,)/3 
or 
gq = (1 — r)/3 and q, = (I — r — 3w)/3. 


(i) 1 = max{qy(1 — 4: — 42 — (r + w)]}. 
of 


From the envelope theorem, 
om ti] 
-a(-2- :) =n(l—1)=0, 


A change in w has two effects: It raises firm 1’s cost and 
it weakens firm 2's strategic position. Because firm 2 
is highly labor intensive, it must reduce its output con- 
siderably, In general, either effect may dominate. 


Exercise 5.5 


(i) max{(1 — 4, —92)q; — 42/211 — 34, — 43 =0. 
u 
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By symmetry, 9; = 42 =}, and NM’ =? =. Note 


that the marginal cost is 4. 
(ii) The two firms’ outputs are obtained by solving 


T'= ma iaa(l — 4 — 42) + x,(a—x,) — (gy + 2,)*/2) 
aus 


and. 
TT? = maxlga(l — 4) — 92) — 92°/2), 
{aa} 
The first-order conditions yield the solution. 
From the envelope theorem, 
am 
“da 


However, for a = }, x, = 0; so dII'/da < 0. 


= eq2 =a 
= Nag th = “ay ie 


Interpretation: When a = 3, firm 1 is indifferent be- 
tween selling one unit in market 2 and not doing so 
(because P(0) = a = { = MC). But the “strategic effect” 
plays against firm 1: Firm 2 knows that fora =} + ¢ firm 
1 sells a bit in the second market and therefore has a 
higher marginal cost. So firm 2 increases its output. If firm 
1 could commit to staying out of market 2, it would do 
so (for a = } + e). This abstention is an example of the 
“lean-and-hungry-look strategy” (“stay thin to remain 
aggressive") developed in chapter 8. 


Exercise 5.6 


@ = = Zilp — ci) = Lia? = Ry 


R pQ & es 
-¢ & 
(i) y= 7 = a5 
implies 
L = Yat, == 


Exercise 5.7 
Outputs and profits are derived in section 5.4. Note that 


Q=4 +92 =4$(2 — 4 — cz) = $1 — 0). 


Thus, total output does not depend on the degree of cost 
asymmetry. Let firm 1 be the low-cost firm: c, <¢ < cy. 
When c, decreases (and c, = 2c — cj increases), q, in- 
creases and q, decreases. Hence, a, increases and a) 
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decreases. Thus, any concentration index satisfying the 
Lorenz criterion (like the three mentioned in the text) 
increases. 

Total profit is 


TI = $[18¢,(c, — 2c) + (2 — 4c + 20c*)}, 
Tis a convex function of c, with a minimum at ¢, = c. 
Exercise 5.8 


(i) W is equal to net consumer surplus plus industry 
profit, or to gross consumer surplus minus total cost. The 
change in gross consumer surplus is equal to pdQ, where 
6Q = I", 6g; the change in cost is equal to E7_, C/dq;- 

(ii) Maximize 5W subject to the constraint 


¥ (Oa? <k 


6W is then called the industry performance gradient index. 
The Lagrangian is equal to 


Y Up — C))dq; — 4877] + Ak, 
& 
We get 
59; = (p — G)/24. 
Thus, 
sw ( ¥ (p— ci?) [2a 
& 
Using the aforementioned constraint, we get 
¥ (p — 00/42? =k. 
ist 


Substituting 4 gives 


where ¢ is the elasticity of demand. 

Looking at small changes is usually (informally) justi- 
fied by the possibilities that the exogenous variables (e.g. 
cost) underlying Cournot competition do move slowly, 
and that those of these variables that are controlled by 
the government ought to be changed slowly (for informa- 
tional reasons, in particular—the tax-reform literature 
emphasizes that the elasticities of demand and supply 
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functions are known only locally, which makes radical 
moves hazardous). 

The manner in which changes in exogenous variables 
and changes in output are linked is an open question. In 
particular, one must look at how close the actual change 
in 6q induced by (say) a policy change comes to the 
change that maximizes 6W subject to the norm con- 
straint. Only this exercise will allow us to assess whether 
bigger potential welfare improvements are indeed asso- 
ciated with higher concentration indices. 

For more examples of types of competition and in- 
dices, see Dansby and Willig. 


Exercise 5.9 


Take an “arbitrary” rationing rule. 

(i) Let p, and 7; denote the lower and upper bounds of 
the support of firm i’s mixed strategy. Suppose first that 
Bi > Dy OF P, = P, and firm j charges p, with zero prob- 
ability. Then firm i makes zero profit, because when 
charging 7; it sells with zero probability, Next, firm j also 
makes zero profit; either p;=c, or p;>c (in the latter 
case, firm i could secure a positive profit by charging 
pj —« and producing D( p;— ¢)—a contradiction). So 
we are left with p; = p; and both firms playing an atom at 
this price (ie., playing this price with positive probabil- 
ity), If D(#,) > 0, each firm can reduce its price slightly, 
still produce D(j;), and be able to sell the whole of D(7;) 
if the other firm charges p; (which it does with posi- 
tive probability). So p; = pj = 7, where D(p) = 0. And, 
again, neither firm makes a profit. 

(ii) If firm j produces D(p;) when it charges p;, con- 
sumers are never rationed by firm j and firm i’s profit 
from charging p, is 


{{1 — F(plpi — che, 
and 


(1 — F(p,))pD( pi) — eq 


for 0 < 4, < D(p;) 


for q; > D(p,). 


Clearly, the optimal quantity for price p; is q, = D(p;) (or 
zero). 
(iii) If F(p) = 1 — c/p, for all p in [c, 7], 


{(1 — Flp)lp — ¢}D(p) = 0. 
Each firm plays an atom at 7, 
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Chapter 5 


Dynarnic Price Competition 
and Tacit Collusion 


The analysis in chapter 5 assumed one-shot competition; 
firms simultaneously quoted their prices, then “dis- 
appeared.” In practice, though, firms are likely to interact 
repeatedly. Durable investments, technological know- 
how, and barriers to entry promote long-run interactions 
among a relatively stable set of firms (this is especially 
true of industries with only a few firms). As was men- 
tioned in chapter 5, repeated interaction may upset the 
Bertrand outcome. With repeated interaction, a firm must 
take into account not only the possible increase in current 
profits but also the possibility of a price war and long-run 
losses when deciding whether to undercut a given price. 

Chamberlin (1929) suggested that in an oligopoly pro- 
ducing a homogeneous product, firms would recognize 
their interdependence and, therefore, might be able to 
sustain the monopoly price without explicit collusion. 
The threat of a vigorous price war would be sufficient to 
deter the temptation to cut prices. Hence, the oligopolists 
might be able to collude in a purely noncooperative man- 
ner. This possibility of tacit collusion was a challenging 
one at a time when economists were primarily concerned 
with overt collusion. Chamberlin even suggested that in 
the absence of hindering factors (to be discussed below), 
the monopoly price was the most likely outcome. 

The purpose of this chapter is to offer an introduction 
to the theory of repeated interaction. Section 6.1 briefly 
reviews the conventional wisdom on collusion and the 
factors that are supposed to hinder or facilitate collusion. 
(The reader may want to complement this section with 
the much richer description of the conventional wisdom 
in chapters 5 through 7 of Scherer 1980.) Because the 
dynamics of price behavior are hard to analyze, and the 
tool to do so (the theory of dynamic games) has been 
developed only recently, there is a considerable literature 
that attempts to formalize the dynamic aspects in a static 
context. This literature, which generally presumes some 
anticipation of one's opponent's reaction to the choice 
of one’s price, is reviewed in section 6.2. Sections 6.3 
through 6.5 use different approaches to develop full- 


fledged thodels of dynamic price competition.’ In all 
those approaches, price cutting yields short-run profits to 
the undercutting firm but triggers a price war, and prices 
above marginal cost (for instance, the monopoly price) 
may be siistained in equilibrium. The motives for “retalia~ 
tion" differ in the three approaches, however. In the first 
approach] (that of the well-established supergame litera- 
ture), the|price war is a purely self-fulfilling phenomenon. 
A firm charges a low price because it expects the other 
firms to do so ("bootstrap” behavior). The signal that 
triggers guch a noncooperative phase is previous under- 
cutting by one of the firms. The second approach pre- 
sumes shprt-run price rigidities; the reaction by one firm 
to a pric cut by another firm is motivated by its desire 
to ml a market share that has been and continues 
to be erpded by its rival's aggressive pricing strategy. 
The third approach (reputation) focuses on (nonphysical) 
intertemporal links that arise from the firms’ learning 
about ea¢h other. A firm reacts to a price cut by charging 
a low ptice itself because the previous price cut has 
conveyed the information that its opponent either has 
a low oh or cannot be trusted to sustain collusion and 
is therefdre likely to charge relatively low prices in the 
future, 

Thus, |we will consider a variety of theories that can 
explain lacit collusion. These should be thought of as 
describing, complementary aspects of repeated price inter- 
action. Jo paraphrase Scherer (1980, p. 151), the pro- 
liferation] of theories is mirrored by an equally rich array 
of behavjoral patterns actually observed under oligopoly. 

Sectioh 6.5 briefly reviews an alternative to optimizing, 
aBproactes—the evolutionary theory of price behavior 
—and the supplementary section provides a further and 
more ad\anced treatment of the dynamic-game approaches 
to tacit epllusion, 


6.1 Conventional 
and Hindering Collusion) 


Starting With Chamberlin, several authors felt that repeated 
Te between oligopolists should facilitate collu- 
sion. They also identified some factors that may hinder 
it. 


6.1.1 Collusion 


Consider a small number of identical firms producing a 
homogeneous product. Chamberlin conjectured that in 
such a situation the firms in the industry would charge the 
monopoly price, ie., the price that maximizes industry 
profit: 

If each seeks his maximum profit rationally and intelligently, 
he will realize that when there are only ttvo or a few sellers his 
oton move has a considerable effect upon his competitors, and 
that this makes it idle to suppose that they will accept without 
retaliation the losses he forces upon them. Since the result of a 
cut by any one is inevitably to decrease his own profits, no one 
will cut, and although the sellers are entirely independent, the 
equilibrium result is the same as though there were a monop- 
olistic agreement between them. (1933, p. 48) 


Several contributions published before World War II 
tried to formalize the discipline imposed by the possibility 
of reactions; some of them will be reviewed in section 6.2. 
By far the best-known is the story of the kinked demand 
curve (Hall and Hitch 1939; Sweezy 1939): Consider two 
firms with marginal cost c. Let q=D(p) denote the 
demand function and let M(p) = (py — ¢)D(p) denote the 
industry profit when the lowest price charged is p. Start 
from a situation in which firms charge the monopoly price 
p™ and each makes profit 11"/2, where TI" = TI(p™). 
Suppose that a firm contemplates deviating from the 
monopoly price and has the following conjecture about 
its rival's reaction: Its rival will stay put at p” if it raises 
its price above p™, and will follow suit (match the price) if 
it cuts its price. It is clear that, under such a conjecture, 
deviating from the monopoly price is not profitable. An 
increase in price leads to a complete loss in market share 
and a zero profit. A reduction in price to p < p™ yields 
a profit of II(p)/2 < 1/2. (The term “kinked demand 
curve” will be explained in section 6.2.) 

As Chamberlin recognized, there are factors that may 
hinder collusion. We distinguish two such factors: detec- 
tion lags and asymmetries between firms. The conse- 
quences of the first factor can be obtained easily from 
any of the dynamic-game approaches mentioned above. 
Unfortunately, efforts to formulate the second factor have 
not been as successful. 


1. Shapiro |986 contains an excellent critical discussion of models of dynamic 
price competition. 
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6.1.2| Detection Lags 


Chamberlinian tacit collusion is enforced by the threat 
of retaliation. But retaliation can occur only when it is 
learned! that some member of the industry has deviated. In 
many ndustries, the prices charged by a manufacturer can 
be observed fairly quickly by its competitors. In others, 
however, prices may remain somewhat hidden. This may 
be the|case, for instance, when the manufacturers sell to a 
small number of big buyers. Rather than quoting a price, 
they njake deals that are particular to each buyer and that 
the other competitors may observe only with a lag. (They 
may observe only the effects of these deals on their mar- 
ket shares.) Because retaliation is delayed, it is less costly 
to a price-cutting firm; therefore, tacit collusion is harder 
to sustain. (In the extreme, suppose that a price cut 
is nev¢r observed. Then everything is as if firms chose 
prices simultaneously—there are never prices around that 
compet can react to. We know from chapter 5 that in 
such alsituation, absent capacity constraints, firms charge 
their marginal cost and make no profit.) 

Infotmation lags make the future more distant and thus 
make nats interaction less relevant. A similar point 
can be| made about the existence of some large sales 
situatidn, such as the arrival of a big order from a large 
buyer, |In such a situation, one would predict that col- 
lusion |will tend to break down because the short-run 
private| gain from undercutting is large relative to the 
long-term losses associated with a subsequent price war. 

Oligbpolists are likely to recognize the threat to collu- 
sion pqsed by secrecy, and consequently may take steps 
to elimjnate it. 

First| they may create an industry trade association that 


(among other functions) collects detailed information on 
the transactions executed by the association's members or 
allows its members to cross-check price quotations. The 
members of the industry can also give advance notice of 
their price changes. 

Second, the oligopolists may impose resale-price main- 
tenance on their wholesalers or their retailers (Telser 
1960). The idea here is that any deviation from collusive 
behavior is easily detected because a manufacturer's good 
is sold at a single price ungarbled by distribution idio- 
syncracies and price discrimination, (The “most favored 
nation” clause, requiring that the seller charge a buyer a 
price no higher than what it charges any other buyer, 
serves a similar purpose and is a significant deterrent to 
price cutting; see Scherer 1980, p. 225.) 

Third, when firms sell hundreds or thousands of differ- 
ent products (as do department stores and manufacturers 
of products with a large number of attributes), rule-of- 
thumb pricing—such as a uniform margin over all pro- 
ducts or the use of representative prices—helps firms 
to quickly analyze one another's pricing behavior in the 
presence of complexity. (Consider, for example, an auto- 
repair shop that sets standard job-completion times and 
then sets an hourly rate.) Similarly, trade associations may 
impose standardization agreements to discourage price 
cutting in a multiple-attribute world, and industries pro- 
ducing products whose transportation costs are high rela- 
tive to their value (e.g. cement, steel, wood, sugar) are 
often alleged to use basing-point pricing to collude.? An 
example of basing-point pricing consists in charging a 
unique mill price; prices to various destinations are then 
equal to the announced mill price plus freight to those 
destinations.*'**° 


of substaiping collusion under lumpiness or secrecy, and Stigler 1964 and Orr 


2. See oa 1980, pp. 220-225, for sone interesting examples of difficulties 
and MacAvoy 1965 for early analyses of the topic. 


3, See chapter 3 for a study of third-degree price discrimination in spatial 
markets. 


4. Some }ypes of rule-of-thumb pricing require a good deal of common in- 
formation} In the case of uniform margins, for example, it would be necessary 
for firms fo know each other's costs to verify that the rule of thumb was being 
obeyed. Hasing-point pricing and pricing according to standard job-completion 
times reqiire less common information. 


5, The ptactices reviewed in this paragraph allow a firm's rivals to infer a 
change i its pricing strategy from a single price observation. They reduce 
informatiqnal delays and thus hasten retaliation. It is also often alleged that 
these praftices (in particular, basing-point pricing) are used to aid coordi- 
nation (rather than information collection) by restricting attention to a single 


price. According to this theory, the firms would have trouble focusing on a 
"focal equilibrium” in a complex environment (perhaps because of bounded 
rationality). 

Neither the information-collection theory nor the coordination theory 
explains how the rule-of-thumb or representative price is chosen (for instance. 
why a particular basing-point pricing system is used rather than f0.b. pricing, 
‘or how an auto-repair shop sets standard job-completion times). 


6. A good example of an industry in which such practices are used is 
the turbine generator industry, which is dominated by General Electric and 
Westinghouse. The buying process for turbine generators by electric utilities is 
highly secret, In 1963, General Electric, quickly followed by its rival, annouriced 
a new pricing policy. First it published a new price book. which contained 
simplified and exhaustive pricing formulas for computing the price of a turbine 
generator as a function of its specifications. (The old price book left a lot of 
leeway for interpretation, in particular for some technical trade-offs and for the 
price of spare parts.) The resulting price was then multiplied by 2 uniform 
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6.1.8 Asymmetries 


Chamberlin’s suggestion that the likely market outcome is 
the monopoly price raises the question of what happens if 
the oligopolists have divergent preferences about prices 
and, jin particular, different monopoly prices. For instance, 
their} marginal costs may differ, so the lower-cost firms 
would like to coordinate on a lower price than the higher- 
cost firms (see chapter 1 for a proof that a low marginal 
cost jimplies a low monopoly price). The firms may also 
offer] differentiated products (differentiated according to 
quality, location, distribution channels, etc.), It is often felt 
that hones in both costs and products may make 
coorflination on a given price difficult (see Scherer's 
chapter 7).7 

Under symmetric conditions, the price to coordinate 
on s¢ems to be naturally the monopoly price. This price 
maximizes profit and involves a symmetric repartition of 
profits.* Under asymmetric costs, there is no “focal” price 
on which to coordinate.” 

The following exercise describes the trade-off between 
recat of industry profits and a “fair” distribution 
of these profits when firms face different marginal costs. 


Exerc\se 6.1°** Suppose there are two firms, with unit 
costs} c, < cy. Let p™(c) denote the monopoly price for 
unit fost c; it maximizes (p — c)D(p). If the two firms 
could get together and sign a contract, they would let 
firm | produce everything and charge p™(c, ). This would 
maximize industry profit. The “pie” could then be divided 
betwen the two firms through an arbitrary lump-sum 
transfer from firm 1 to firm 2. But suppose that it is illegal 
for the firms to overtly agree and use side payments. We 
can determine the set of industry allocations that are 
efficient for the two firms, constrained by the fact that 
side payments are prohibited. (Exercise 6.5 involves the 
question whether such “constrained efficient allocations” 
can i sustained by equilibrium behavior in repeated 
interaction.) To this purpose, we fix a profit target I? 


for firm 2 in the interval (0, M™(c,)] and look for 
profit-sharing agreements (without transfers) in which 
both firms charge the same price p. They choose market 
shares s, and s, such that s, + s, = 1, The interpreta- 
tion of these market shares is that firm i produces exactly 
4; = s;D(p); if s; <4, the consumers who go to firm i 
and are rationed buy from firm j. Given a profit target of 
TI? for firm 2, the efficient allocation is a choice of price p 
and market shares s, and s, so as to maximize IT!: 


max TI! = (p—c,)s,D(p) 
(p.s;-33} (6.1) 


s.t. II? = (p — c,)s,D(p) > M1? and s, +s, = 1. 


Suppose that the profit function (p — c)D(p) is concave 
for all p and c. 

(i) After substituting s,, obtain the first-order condi- 
tion. Show that 


p(c,) <p < pc). 
Show that the objective function is quasi-concave. 
(ii) Show that : 


: (cg — cy)? 
(p—4)D') + Dep + SA = 


(iii) Conclude that as 11? grows, both p and s; grow. 

(iv) Show that the Pareto frontier is convex. 

(v) What do you conclude from (iv)? 

(vi) Show that if one now allows the firms’ prices to 
differ, for the efficient-rationing rule (see chapter 5), deter- 
ministic efficient market-sharing allocations indeed imply 
identical prices for both firms, 


6.1.4 Other Factors 


Factors that weaken price competition in a static context 
might also facilitate collusion in a repeated-price-inter- 
action situation. In particular, decreasing returns to scale 
(or capacity constraints) make undercutting less profitable 
today. However, they also weaken the strength of future 


multiplier, just as in the case of an auto-repair shop that sets a uniform hourly 
rate. (The two rivals used the same book prices, and even ended up using the 
same njultipliers.) Second, GE announced that it would not price-discriminate 
betweeh different buyers (including intertemporally—its contracts included a 
six-moyth price-protection clause, which was enforced by a public accounting 
firm hirtd by GE). For more on this, see Sultan 1975 and Porter 1983. 


7. Of gourse, if the firms’ products are very differentiated, there is no real 


competition anyway, and the problem of collusive coordination does not arise, 
‘The informal literature is a bit confused conceming these differentiation effects. 


8. The reasoning here is very loose; there is no formal argument justifying it 


9. For a discussion of Schelling’s (1960) theory of focal points, see Scherer 
1980, pp. 190-193. For a discussion of why collusion might be more complex 
with asymmetric firms, see Scherer 1980, pp. 156-160. 
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retaliations, as they limit the output that firms can supply 
in the market. Thus, the effect of decreasing returns to 
scale is a pripri ambiguous,'° 

Multimarket contact is generally thought to blunt the 
incentives for rivalry. Corwin Edwards puts it in the 
following way: 


[Firms that compete against each other in many markets] may 
hesitate to fight vigorously because the prospects of local gain 
are not wh the risk of general warfare... A prospect of 
advantage from vigorous competition in one market may be 
weighted against the danger of retaliatory forays by the com- 
petitor in other markets. (quoted in Bernheim and Whinston 
1986) 


also fight vigorously and simultaneously in all markets, so 
that its short-term gains would be not local but general. 
In section 4.3 we will review the multimarket contact 
argument, 
The number of firms in the industry is, of course, 
thought of as affecting the possibility of collusion. Indeed, 
Bain's (1954) original concern with market concentration 
was based op an intuitive view that high concentration is 
necessary (iff not sufficient) for collusive outcomes. 


6.2 Static| Approaches to Dynamic Price 


Competition 


However, ‘ Bernheim and Whinston note, a firm might 


6.2.1 Kinked Demand Curve 


As was mentioned in section 6.1, the story of the kinked 
demand curye was designed to explain why oligopolists 
shy away from frequent price cutting. Suppose that there 
are two firms, i= 1, 2, with unit cost c. The demand 
function is q]= D(p). 

The simplest version of the kinked-demand-curve story 
assigns a spacial role to a given price; we will call this the 
“focal price,’ p'. We can think of p' as the current market 
price or as what firms believe is the steady-state (long- 
tun) price. Hach firm has the following conjecture: If it 
charges p > |p‘, its rival will not follow suit (i.e., will keep 


charging p’). If, instead, a firm cuts its price to p <p', 
its rival will exactly match the price cut. The residual 
demand for a firm that increases its price is zero; the 
residual demand for a firm that charges p < p' is D(p)/2. 
In summary, each firm envisions for its rival the “reaction 
curve” shown in figure 6.1. The perceived-demand curve 
for firm i is shown in figure 6.2. 

With such beliefs about its rival's reaction, firm i maxi- 
mizes (p; — c)D(p;)/2 subject to p, < p'. If we assume 
that the profit function (p — c)D(p) is increasing to the 


i) 


p 


Figure 6.1 
Reaction function for kinked demand curve. 
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Figure 6.2 
Kinked demand curve. 


10, Brock and Spheinkman (1985) consider a dynamic price model in which 
firms are constrained by the same exogenously given capacity: The price game 
at each period is|similar to the capacity-constrained price games considered in 
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chapter 5. They justify the previous intuition and show that, for the equilibrium 
they select, the collusive price is not necessarily a monotonic function of 
industry capacity. 
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left of p'? and decreasing to its right (i.e., that it is quasi- 
concave), the optimal price for firm / is thus equal to p’ if 
p' <p™|and to p™ if p’ > p™. We conclude that both 
firms charging any p' is an “equilibrium” as long as p' lies 
between|c and p™ and each firm expects its rival to react 
as described above. 

The Kinked-demand-curve story can be criticized on 
many grpunds. First, beyond the general issue of model- 
ing dynamic situations in a static framework (an issue 
taken up below), one may worry about the embarrass- 
ment of riches associated with the large number of equi- 
libria. Like the supergame theory described in section 6.3, 
the kink¢d-demand-curve story is, in a sense, too success- 
ful in explaining tacit collusion. Indeed, any price between 
marginal cost and the monopoly price can be the outcome 
of price ¢ompetition. We have no indication of how firms 
end up pt a given focal price. Nevertheless, it can be 
argued that p' = p™ is the logical outcome, because it is 
the best focal price for both firms and thus the firms have 
an reve to coordinate on it. Second, does the focal 
price change when costs change? Again, this question is 
left unanswered. Some have assumed that, when feasible, 
the focal price is invariant to small changes in cost. This 
assumption has important consequences for economy- 
wide prige rigidities. Suppose that for initial unit cost c the 
focal prite is p™(c), Assume that, at some point of time, 
the cost| is permanently increased to c’ > c (see figure 
6.2). Th price remains at p™(c). However, if the unit cost 
become <c, then p™(c) cannot be a focal price any 
longer, since p™(c”) < p™(c), and we can assume that the 
price goes down to the new focal price, p™(c”). Thus, 
the assumption of an invariant focal price yields upward 
rigidity but not downward rigidity. Yet, according to 
Scherer (1980, p. 168), prices tend to be at least as rigid 
downward as they are upward in well-disciplined oligo- 
polies. However, one could make the equally likely as- 
sumption that the focal price automatically adjusts to 


substitutes. [f IT! were differentiable (see the case of differentiated commodities 


11, The prgfit function is nondifferentiable (when prices are equal) for perfect 
in chapter as well as R,, one could write the first-order condition 


TU (pus Ral.) + ME(pe. RalPr)R3(Pr) = 0, 


where IT; dénotes the partial derivative of 1! with respect to price p,. 

For a homogeneous product, use is generally made of “Cournot competi- 
tion." For irjstance, with two firms, firm 1 perceives that firm 2 reacts to output 
4, by prodpcing q, = R3(q,) (and conversely), On the assumption that R, is 
differentiable, the maximization of 
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the new monopoly price when the costs change. The 
asymmetry between upward and downward price ad- 
justment would then disappear. The conclusion is that, 
because we have little insight into how the focal price is 
selected, the kinked-demand-curve story has little predic- 
tive power. (For more criticism of the story, see Stigler 
1947.) The most sensible aspects of the story may well be 
the suggestions it makes about reactions and rivalry. 


6.2.2 Conjectural Variations 


Like the story of the kinked demand curve, the story of 
conjectural variations (Bowley 1924) assumes that each 
firm believes that its choice of price will affect the price 
selected by its rival. The difference is that this story 
generally anticipates a smoother, less asymmetric reac- 
tion. The analogy is that, for instance, firm 1 believes that 
firm 2 will react according to R,(p,) and maximizes 
TI'(p,,Ry(p,)) over p,.!* As with the choice of a focal 
price in the kinked-demand-curve story, we have little 
insight into the choice of the conjectural variation R,. 


6.2.3 Discussion 


The modelling of dynamic features in a static frame- 
work is, @ priori, very appealing. As we will see, dynamic 
price competition is complex, and for many applica- 
tions one would like it to be subsumed in some kind 
of “reduced-form” static competition. This is precisely 
what the kinked-demand-curve and conjectural-variations 
approaches attempt to do. However, this methodology 
suffers one major drawback: A static game is, by defini- 
tion, a game in which each firm's choice is independent 
of its rivals’ choices. By the very timing and information 
structure of the game, firms cannot react to one another. 
Thus, any conjecture about one’s opponents’ reaction that 
differs from no reaction is irrational. We conclude that 
this methodology is not theoretically satisfactory, as it 


Ply + Ralgs dn, — Clq) 
yields 
P— Cy + 4,P(1 + Ry) =0. 


Cournot competition corresponds to “zero conjectural variations": Rb = 0. 
The competitive solution is obtained for a negative conjectural variation: 
R; = —1—that is, each firm perceives that any increase in its output is exactly 
offset by a decrease in the rival's output, so total output (and, therefore, price) 
is perceived as exogenous. The reader may check that the collusive (industry- 
profit-maximizing) outcome is ebtained for positive conjectural variations. 


Chapter 6 


does not sipbject itself to the discipline imposed by game 
theory.'? 

Now, it nay be that some conjectured reactions in the 
static modgl yield the same outcome(s) as a full-fledged 
dynamic price game. However, to know this, one must 
have studitd the dynamic game. One must also have 
checked that the two approaches yield the same response 
to exogenqus demand and cost shocks (which is unlikely, 
because static approach cannot describe the adjust- 
ment path following a shock). Absent such verifications, it 
is not clear|what is achieved by the static approaches. 


We now 2 up the two-firm model introduced in sec- 


tion 5.1, The two firms produce perfect substitutes with 
the same alain cost, c. The lower-price firm gets the 
whole market, and the firms share the market when they 
charge the same price, The only difference here is that we 
replicate the basic Bertrand game T + 1 times, where T 
can be finite or infinite, The game is then called a repeated 
gare, or alsupergame. Let T1'( pi, py) be firm i’s profit at 
date ¢ (f+ 0,...,T) when it charges p;, and its rival 
charges p;,| Each firm seeks to maximize the present dis- 
counted value of its profits; that is, 

r 

¥ 5'M'(p]. pp) 
= 

where 6 is the discount factor (6 =e", where r is 
the instantaneous rate of interest and ¢ is the real time 
between “periods”), 5 close to 1 represents low impa- 
tience, or rapid price changes. 

At each] date #, the firms choose their prices (p;,,P2) 
simultaneolsly, There is no physical link between the 
periods; the previous choice of price by one’s rival is 
already obsolete when one chooses a price. Nevertheless, 
we will allow price choices at date t to depend on the 
history of| previous prices. Thus, the price strategy pi, 
depends on the history 


A, = (Pro, P2087 Pst Pat) 


We require strategies to form a “perfect equilibrium” (see 
the Game Theory User's Manual). That is, for any history 
H, at date t, firm i's strategy from date t on maximizes the 
present discounted value of profits given firm j’s strategy 
from that date on. 

First we assume that the horizon is finite: T< +00. 
What is the equilibrium of the dynamic price game? As is 
explained in the Game Theory User's Manual, we need to 
proceed by “backward induction” to obtain the perfect 
equilibrium, First, let us ask how, given the history Hy 
of the game, the firms choose prices in the last period 
T, Because the past prices do not affect the profits in 
period T, each firm ought to maximize its “static profit” 
TI'( pir, pyr), given its rival's price. Hence, the equilibrium 
is, for any history, the Bertrand one: 


Pix Far Ss 


What will the equilibrium prices in period T — 1 be? Since 
price choices at T do not depend on what happens at 
T — 1, everything occurs as if T — 1 was the last period. 
Therefore, the firms also choose the competitive price at 
T — 1. regardless of the history up to this period: For any 
titer 


Pi,r-1 = Par-1 = 6 


And so forth by backward induction. The outcome of the 
(T + 1)-period price game is the Bertrand solution re- 
peated T + 1 times. Therefore, the dynamic element con- 
tributes nothing to the model. 

The picture changes dramatically when the horizon is 
infinite (T = +00). On the one hand, it is easy to verify 
that the Bertrand equilibrium repeated infinitely is an 
equilibrium of this game. To see this, consider the follow- 
ing strategy: Each firm chooses a price equal to the mar- 
ginal cost in each period t, regardless of the history of the 
game up to f. Given that the rival firm charges a price 
equal to c in this manner, each firm can do no better than 
to charge c itself. On the other hand, the interesting 
feature of this game is that the repeated Bertrand equilib- 
rium is no longer the only equilibrium, Let p™ denote the 


12, However, the conjectural-variations approach may have been a useful way 
of empirically pstimating the degree of noncompetitiveness in an industry — 
see, eg, Appelbaum 1982; Bresnahan 1981, 1987a, Iwata 1974; Sumner 1981. 
(The empirical Hefinition of conjectural variations does not completely coincide 
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with the theoretical one. See Bresnahan 1987b,) Although it is hoped that the 
testing of full-fledged dynamic models will develop, it must be acknowledged 
that such models are complex, and that little attention has been paid to their 
testable implications. 
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monopply price (it maximizes (p — c)D(p)) and consider 
the following (symmetric) strategies: Each firm charges p™ 
in peribd 0. It futhermore charges p™ in period ¢ if in 
every period preceding t both firms have charged p™; 
otherwise it sets its price at marginal cost c forever.’* 
These |strategies are called trigger strategies, because a 
single Heviation triggers a halt in the cooperation. They 
constithte an equilibrium if the discount factor is suffi- 
ciently|high: In charging p™, a firm earns half the monop- 
oly profit in each period. By deviating from this price, a 
firm c4n earn maximum profit, I, during the period 
of devjation (indeed it can earn approximately TI™ by 
se undercutting price p™), but then it receives zero 
forever more. Therefore, if 


Ma pots? toy > nn" 


which sei if 6 > 4, then these trigger strategies are 
equilibtium ones, 

This| result is a formalization of tacit collusion. If a firm 
underctits the monopoly price, it gains during the period 
of deviation but it destroys collusion in the later periods 
—the firms revert to the “grim strategy” (ie., they play 
purely se at forever, which we know is an equi- 


librium}. Note that collusion is enforced through a purely 
noncodperative mechanism. 

ge are many other equilibria in this game. The pre- 
vious 1easoning actually implies that any price between 
the competitive price and the monopoly price can be 
sustainfd as a (time-invariant) equilibrium price as long as 
the distount factor is greater than 4 (which implies that 
any syjnmetric per-period profit between 0 and II™ can 
be an |equilibrium profit). Let p belong to [c,p™], and 
let eat firm charge price p as long as neither has yet 
deviated from that price. If either firm has deviated in the 
past, pis charge the competitive price forever. Again, 
these s{rategies are equilibrium strategies. By conforming 
to p, edch firm gets 


0 
TOP al 4. 5 4 6? + ---, 


If a firm deviates, it gets at most II(p) during the devia- 
tion period (because its rival charges yp). Thus, it gains at 
most II(p)/2 during that period, and it loses half of the 
profit at price p forever after: 


TI(p) 


BOs 4 #4.) =m 


2(1— 6)’ 


So if 5 > (1 — 6)—that is, 6 > 4—deviating from price 
p is not privately optimal. 

The preceding result is one facet of a general result, 
known as the Folk theorem. For the repeated price game 
under consideration, the Folk theorem asserts that any 
pair of profits (1', 11?) such that 
T!' > 0, 1? > 0, and 1 + 1? <1" 
is a per-period equilibrium payoff for 6 sufficiently close to 
1. That is, there exist perfect equilibrium strategies 


{ paiH,). Pe(H,)} 


that form a perfect equilibrium such that, for all i, firm i’s 
associated per-period payoff, 


(1 — 6) ¥ 81 pu, py, 
* 


is equal to I‘.'* This is depicted in figure 6.3. 
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Attainable per- 
period payoffs 
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Figure 6.3 
‘The Folk theorem for the repeated price game. 


13. Formplly; p,(H,) = p™ if H, = (p™,p")-..:p".p™), and py(H,) = ¢ other- 
wise 


14, Multiph 
toa per4 


lying the intertemporal payoff by 1 — 8 amounts to normalizing it 
jod equivalent. Notice in particular that if T1'(p,,, p,) is independent 
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of time and equal to IT’, then 


(1-8) F oe (p,.py) = — HU + 64.52 +9! = TH 
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Exercise 6.2" Show that any payoff (II', 11?) such that 
Tl! > 0] 1? >0, and M1! + 1? <M" is an equilibrium 
payoff fpr 6 close to 1. 


Exercise 6.3"** Show that for 5 < } the only equilibrium 
profit is|the competitive (zero) profit. Restrict yourself to 
pure strategies. Hint: Consider the highest (supremum) 
Pepe ot that is attainable by a firm in a perfect 
equilib: 


Thus when 6 is close to 1, everything is an equilibrium 
(reverything” because aggregate profit cannot exceed IT™, 
and because equilibrium profits cannot be negative—a 
firm car| always guarantee itself non-negative profits by 
charging prices above marginal cost or by exiting the 
market). 

The Folk theorem demonstrated here was proved by 
Friedman (1971, 1977). More general versions of this 
theorem have been provided by Aumann and Shapley 
(1976), [Rubinstein (1979), and Fudenberg and Maskin 
(1986). See also subsection 6.7.3 below. 


Remark| The easiest way to enforce a given price (and, 
possibly, given market shares) is to punish deviations as 
harshly jas possible. In the context of a price game with 
perfect substitutes, maximal punishments take a simple 
form: They correspond to the competitive Bertrand (static 
and dynamic) equilibrium, in which all firms make no 
profit. (There do not exist harsher punishments, because a 
firm can{ always exit the market—or, equivalent, charge a 
very high price—and guarantee itself a zero profit after it 
has devjated.) Of course, such a punishment is also harsh 
to its enforcers, but along the equilibrium path it has no 
cost bedause it is not observed (deviations do not occur). 
Thus, td see whether a given behavior is indeed sustain- 
able ti ulin it suffices to assume that any devia- 
tion leas to an eternal reversion to Bertrand behavior. 
This regsoning is part of a more general principle (see 
Abreu 4983, 1986)'5; as we will see in subsection 6.3.3, 


imal punishment is clearly optimal when deviations are perfectly 


i oligopoly supergames (1986), and unveils a simple stick-and-carrot 
onal nature of these punishments (ibid) 


ider who has little familiarity with supergame theory may want to 
ipplications in a first reading. The advanced reader may study further 
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it depends crucially on all price choices being perfectly 
observable. 


The supergame theory is, in a sense, too successful in 
explaining tacit collusion, The large set of equilibria is an 
embarrassment of riches. Somehow the firms must coor- 
dinate on a “focal equilibrium’ in order for the equilibrium 
concept to remain attractive. How is this equilibrium 
chosen? A selection process often used in the literature 
makes the assumption that in a symmetric game the focal 
equilibrium is symmetric and the assumption that the 
focal equilibrium must be Pareto optimal from the view- 
point of the two firms (ie, must yield a payoff on the 
frontier of the attainable set of per-period profits). In the 
previous example, these assumptions clearly select equi- 
librium strategies that yield per-period profits 11’ = TI? 
= TI"/2 when 6 > $ (enforced, e.g., by the strategy of 
charging p™ as long as every firm has charged p™ earlier, 
and charging c in case of a deviation). 


6.3.2 Applications 


Let us now apply the theory to obtain a formalization of 
some aspects of the conyentional wisdom reviewed in sec- 
tion 6.1. (Because the supergame framework is technically 
the easiest to work with, we discuss the conventional 
wisdom here.) We will content ourselves with illustrative 
examples; the theories have generally been developed in 
a more general context than is presumed here.'® 


6.3.2.1 Application 1: Market Concentration 


As was mentioned earlier, Bain’s (1956) original concern 
with market concentration was based on an intuitive rela- 
tionship between high concentration and collusion. There 
is a sense in which tacit collusion is easier to sustain with 
a smaller number of firms: Consider a homogeneous-good 
industry with n firms facing the same constant marginal 
cost, and look at the fully collusive outcome, in which all 
firms charge the monopoly price and share the market 


applications of the supergame methodology: e.g, Mookherjee and Ray 1986 
(on repeated games under learning by doing or increasing returns to scale), 
Rotemberg and Saloner 1985a (on strategic inventories), Rotemberg and 
Saloner 1985b (on price leadership), and Slade 1985 (on price wars as an 
information-gathering device when industry demand is subject to periodic and 
unobserved random shocks). 
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equally. |The per-period and per-firm profit is II™/n, a 
decreasing function of n. A large number of firms reduces 
the profit per firm and thus the cost of being punished for 
undercutting. In contrast, the short-run gain from under- 
cutting the monopoly price slightly is 


TI™(1 —/ 1/n) — eg, 


and thus increases with n. The discount factor must ex- 
ceed 1 + 1/n for collusion to be sustainable; in this sense, 
market goncentration facilitates tacit collusion. 


6.3.2.2 | Application 2: Long Information Lags and 
Infrequent Interaction 


The threat of a punishment operates only if the punish- 
ment comes fairly soon after a price cut. Punishment 
might ie delayed for two related reasons. First, a firm's 
price cyt may be learned of by its rival only with a 
lag. Th may happen when manufacturers contract with 
a few big buyers (wholesalers or downstream manufac- 
turers), [The secrecy of contracts may then be an obstacle 
to collusion. Indeed, if price cuts were never detected, 
collusion could not be sustained. Second, infrequent inter- 
action (Hue to a lumpiness in orders, for instance) delays 
the punishment and makes current price cutting more 
attractive. 

The becond reason is straightforwardly formalized in 
the psa framework. A more infrequent interaction 
corresppnds to a decrease in 5. But we know that if 6 lies 
betweah 0 and 4 no collusion is sustainable, and that if 5 
exceedy } any outcome is possible, including collusive 
ones (this supergame vindication of infrequent interaction 
as a cause of breakdown of collusion is weak because 
the theory only predicts that collusion can occur in 
equilibrium). 

The ffirst reason is harder to formalize, unless one is 
willing |to make the strong assumption that profits and 
demands are observed with a lag as well. For instance, 
consider the duopoly model, and assume that prices are 
observed two periods (instead of one) after they are 
chosen) Assume further that a firm’s profit and demand in 
a giver} period are observed by this firm at least two 
periods later, so that a firm cannot infer from the observa- 
tion of|its past profits and demands anything it does not 
know about its rival's pricing behavior. In such a situa- 
tion, a|firm can deviate and cut its price for two periods 
before being detected. The monopoly price (or any other 
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price) is sustainable in equilibrium if and only if 
wa +64+6+--)> 01 +8) 


or 


Thus, the condition is more stringent than the previous 
one (6 > 4), as 1/,/2 > 4. In this sense, information lags 
are also a cause of breakdown of collusion, However, 
the assumption that profit and (especially) demand are 
observed with a lag is strong. In subsections 6.3.3 and 
6.7.1 we make the polar assumption that prices are com- 
pletely secret (never revealed to a firm’s rival), but that a 
firm learns its demand and its profit immediately (i.e, one 
period after the price choices). 


6.3.2.3 Application 3: Fluctuating Demand 


Let us now consider Rotemberg and Saloner’s (1986) 
theory of price wars during booms. Suppose that demand 
is stochastic, At each period #, it can be low (q = D,(p)) 
with probability $, or high (q = D,(p)) with probability 
4. Assume that D,(p) > D,(p) for all p. To simplify, as- 
sume that the demand shock is identically and indepen- 
dently distributed over time. At each period the two firms 
learn the current state of demand before choosing their 
prices simultaneously. 

We look for a pair of prices { p,, p2} such that (a) both 
firms charge price p, when the state of demand is s, (b) the 
price configuration { p,,p2} is sustainable in equilibrium 
(i.e. there exists an equilibrium in which deviating from p, 
in state s is not privately optimal), and (c) the expected 
present discounted profit of each firm along the equilib- 
rium path 


— 8 (1 Oley _ 1D,z(p2), 
v=¥ (po (ps ahs a (p2 — ¢) 


1 D,(p,) 1 D,(p2) 
= (peep, 9-4 TP p, 0) ra) 


is not Pareto dominated by other equilibrium payoffs (i.e., 
one cannot find an equilibrium that both firms would 
prefer). 

We know from the maximal-punishment principle (see 
the remark in subsection 6.3.1) that in order to enforce the 
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pair of prices { p,,p2} one can assume that after a devia- 
tion the two firms charge their competitive price c forever 
(and thus make no profit). 

Welfirst examine whether the “fully collusive outcome” 
is sustqinable in equilibrium. By “fully collusive outcome” 
that the two firms charge the monopoly price 
in|each state of demand s (where p™ maximizes 
Tls(p)  (p — ©)D,(p)). We let 


Ty = {pr — ODA pr) 


denote} the monopoly profit in state s. If the monopoly 
profit gan always be sustained, then 


(ie + me) 
1-8 


From the maximal-punishment principle, the future loss 
from deviating at some date, discounted at that date, is 
OV. Undercutting p?" slightly in state of demand s yields 
an extra gain of almost 


m [pm 
fie — oe. ee 
+ ue 2 


to the that deviates. Thus, for p™ to be sustainable for 

all s, on? must have 

mS <6V (6.2) 
2 


for all 55 However, because I" < IT, this condition is 
satisfied] if and only if 


* 
a < OV, (6.3) 
or substituting for V, 
2np 
> ——, . 
5 > bq uF (6.4) 


Bone > TI7, do lies strictly between 4 and 4. 
This result already offers some insight. When demand 
is high, |the temptation to undercut is important. The 


punishment entails the loss of an average of high and low 
profit and is therefore less severe than it would be if the 
high demand were to persist with certainty in the future. 
Thus, when 6 lies between } and dp, full collusion cannot 
be sustained in the high-demand state, contrary to the 
case of a deterministic demand. 

The interesting case is that of a discount factor in the 
interval [4, dy). We must choose p, and p, so as to maxi- 
mize the firms’ expected payoff subject to the incentive 
(no undercutting) constraints: 


abe Gaye 4l mY) Ia =4) re 


2 2 2 2 


subject to 


Tips) © (1 Mm) iMate) » 
> <a(} 7 ee - NE 6) (6.6) 


T1,(p2) 1M, (p,) 3 TtPa)) | je 
: <6(} sae Vj 6). (6.7) 


Intuitively, the binding constraint should be 6.7, because 
the temptation to undercut is higher when demand is 
high. Indeed, it can easily be shown that this is the case: 
Note that the program is equivalent to 


max{II,(p,) + T12(p2)} (6.5) 
subject to 

Tl, (p;) < KT1,(p2) (6.6') 
and to 

T1z(p2) < KMy(p,), (6.7') 
where 


K=6/(2 — 36) >1. 


Ignore 6.6’ and maximize 6.5’ subject to 6.7’. Clearly, 
choosing p, = p? increases the objective function and 
relaxes constraint 6.7’ as much as possible. And, p2 is 
then chosen under p3 so that I,(p2) = KI, (p? 
= Kips? 


17. Such a fy exists because, by the definition of 5, 
KMy = smr}/(2 — 36) < np 


for 5 < dy, 
Mp2) = 


must choose p, < p? rather than the other root of the equation 
1(p7") because otherwise a firm could undercut to p> in the high 
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state of demand. Note also that the ignored constraint 6.6' is satisfied as 
KM1,(p;) = KN? > NF from K > 1 
18, Maximizing 6.5’ with respect to 6.6’, ignoring 6.7, would similarly yield 


P: = pf and I1,(p,) = KTI3. However, because K > 1 and If < 13, no such 
P exists, so this cannot be the solution. 
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We thus conclude the following: For 6 in |4, 69], some 
collusion is sustainable. In the low state of demand, firms 
charge the monopoly price in that state: p, = p?". In the 


high state of demand, firms charge below the monopoly 
price in |that state: py < p¥. (p, can be higher or lower 
than p,,\depending on the demand function.) Rotemberg 
and Saloner interpret this as demonstrating the existence 
of a price war during booms—i.e., as representing a situa- 
tion in which firms are forced to lower the amount of 


temberg-Saloner analysis seems plausible. At 
(industry) level, cartels tend to break down 
ig order arrives, For instance, Scherer (1980, 


cement—p good produced by an oligopolistic industry — 
tends to rhove countercyclically, which is consistent with 
(but, as observed, not necessarily predicted by) their 
theory. See also their discussion of R. Porter's (1983a) 


study of the railroad cartel on the Chicago—New York 
route in the 1880s and Bresnahan’s (1981) analysis of the 
American ee industry in the mid-1950s, 


. 


Exercise 6.4" Consider an n-firm supergame framework. 
The firms| have constant marginal cost ¢. The demand 
function af date t is q, = “'D(p,), where 4d < 1 (4 is the 
discount factor),2° Derive the set of discount factors such 
that full tunica (ie, the monopoly solution) is sustain- 


able as an equilibrium of the supergame. What would this 
model predict about the relative ease of sustaining col- 
lusion in expanding and declining industries? 


6.3.2.4 Application 4: Cost Asymmetries 


In section 6.1 we saw that two firms with unit costs 
¢; <¢y have conflicting preferences on the price to be 
charged, Efficient market-sharing arrangements may, for 
instance, call for firm 2 to supply less than the demand 
it faces in exchange for firm 1's charging above its mo- 
nopoly price, or they may even involve the two firms 
taking turns supplying the market. The following exercise 
illustrates that a variety of market-sharing patterns can be 
sustained in a supergame when the firms are not too 
impatient, 


Exercise 6.5°* Consider the deterministic efficient market- 
sharing allocations derived in exercise 6.1. Let {p*, st} 
denote an efficient market-sharing allocation, and let 


TI’? = s{D(p*)(p* — c,) 
and 
1" = (1 — s{)D(p")(p* — c) 


denote the corresponding per-period profits. Consider the 
following strategies: Each firm i charges p* and produces 
s{D(p*) as long as both have complied with this rule 
earlier. If anyone has deviated in the past, both firms 
revert to Bertrand behavior forever.?' Determine the set 
of efficient market-sharing allocations that can be enforced 
in this manner. 


One issue that was alluded to earlier is the choice of an 
equilibrium allocation. Even if one accepts the idea of 
selecting a Pareto-optimal allocation, one has little clue as 
to how to choose p* (or sf) in the feasible range. In 
the symmetric case, one could invoke the idea that a 
symmetric equilibrium is focal; but whatever worth this 
argument may have, it is not applicable to rivalry under 
asymmetric costs. The difficulty of picking a focal equilib- 


stemming Price rigidities. On the other hand, price wars during recessions 
are likely to ofcur if the capital market is imperfect (see the long-purse story in 
chapter 9). 


20. This is the same as the situation discussed by Rotemberg and Saloner, 


19, We soft identify another factor leading to price wars between booms 


except for two things: Shocks are perfectly anticipated, and there is a trend (the 
market grows if « > 1 and shrinks if « < 1), 


21. Under asymmetric payoffs, the reversion to Bertrand behavior is not the 
maximal punishment for the low-cost firm. The worst equilibrium outcome is a 
zero profit. See section 6.7. The qualitative analysis would not be affected by 
this consideration. 
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rium has been perceived by some as hindering tacit 
collusion. 


6.3.2.4 Application 5: Multimarket Contact 


To unHerstand why multimarket contact may help, recall 
the inkentive constraint for a constant price (e.g., the 


monopoly price) to be sustained in a single market: 
nm mm 
meen fans sey, 
z Po (L+6 +>"), 
or 
6 
1< 


‘t 
As bal been noted, 5 = 4 is sufficient for full collusion to 
be sustainable. A higher 6 is “overkill”; the punishment is 
then more than sufficient to keep market discipline. 

Now suppose that there are two identical and inde- 
pe ats and that both firms participate in both 
markets. Assume further that market 1 “meets more fre- 
quently” than market 2—i.e., either orders arrive more 
rapidly or the information lag is shorter. To be concise, let 
us assume that market 1 meets every period and market 2 
meets pvery even period. If the discount factor between 
periods is 6, the implicit discount factor for market 2 is 
equal fo 67. Suppose that 6* <4 <6. Then we know 
from siibsection 6,3.1 that, in the absence of multimarket 
contact, collusion is sustainable in market 1 but not in 
market|2. On the other hand, with multimarket contact, 
full collusion on both markets is sustainable if 


m 
ee ee ee) 


n 
OE bMS Poe (6.8) 


or 
0<4sf+5—2 (ord 0,593). (6.9) 


Let us jderive 6.8. The highest temptation to undercut 
occurs Pvery even period, when both markets are open. 


A firm that undercuts might as well undercut on both 
markets simultaneously, given that it will be punished in 
both markets for its deviation anyway. The gain from 
undercutting the monopoly price is II"/2 per market and, 
hence, II™ overall. The right-hand side of 6.8 represents 
the future loss of collusive profits on both markets asso- 
ciated to the reversion to Bertrand behavior (the maximal 
punishment). 

So, for instance, for 5 = 0.6, full collusion on both mar- 
kets can be sustained under multimarket contact, whereas 
no amount of collusion is sustainable in market 2 under 
single-market contact. 

The intuition for this result is that the loss of collusion 
on market 1 may be so large as to deter deviations not 
only on market 1 but also on market 2. (Technically, 
the incentive (no-undercutting) constraints on the two 
markets are pooled into a single constraint—i.e., 6.8. If 
they are both satisfied, then 6.8 is also satisfied. However, 
the converse does not hold, as has just been shown. The 
set of sustainable allocations cannot be smaller under 
multimarket contact than under single-market contact.) 

Bemheim and Whinston (1986) offer a much more 
complete treatment of multimarket contact and tacit 
collusion.?? 


Exercise 6.6" Consider two firms interacting in two iden- 
tical and independent markets. The markets differ in that 
in market 1 a firm's price at time ¢ is observed at t + 1, 
whereas in market 2 it is learned only at t+ 2. Thus, 
although each of the markets meets every period, market 
2 has longer information lags. 

(i) Argue that in the absence of multimarket contact, 
collusion in market 2 would be sustainable if and only if 
62> 1//2 = 0.71. 

(ii) Show that under multimarket contact, collusion in 
both markets is sustainable if (and only if) 6 > 5, where 
6 = 0,64. 


6.3.3. Secret Price Cuts 


In subsections 6.3.1 and 6.3.2 it was assumed that after 
some information lag, a firm's past price choice is per- 
fectly observed by its rival. However, one can think of 
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22. Multimarket contact may raise the firms’ profits, but need not raise the 
price in both markets. As Bernheim and Whinston show, the price may be 
higher in one market and smaller in the other. 
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instances f, which prices are not observable. As was men- 
tioned in fhe supplementary section of chapter 4, a firm 
may offer|a price discount to a customer, or may increase 
the quality of its services without raising the price. If 
its cals oe is unobservable, a firm must rely on the 
observatign of its own realized market share or demand 
to detect any price undercutting by the rival. However, if 
the demand function is random and shocks are unob- 
servable, this inference process is garbled. A low market 
share may|be due to the aggressive behavior of one’s rival 
or to a slhck in demand. Hence, when demand is very 
random, price cuts are hard to detect. As Stigler (1964) 
noted, thiq tends to hinder collusion, 

E, Green and R. Porter (1984; see also Porter 1983b) 
have devgloped a supergame model that formalizes the 
issue of sderet price cutting.2? As was mentioned above, 
the maxinjal-punishment principle need not apply to such 
a situation. When price choices are perfectly observable, 
it makes spnse to resort to extreme punishments because 
such punishments are never observed on the equilibrium 
path and |therefore are costless to the firms (they are 
just threats), Under uncertainty, mistakes are unavoidable 
and maxirpal punishments (eternal reversion to Bertrand 
behavior) peed not be optimal. 

As the dnalysis of collusion under price secrecy is more 
complicatdd than that under perfectly observable prices 
(because i ae punishments must be determined), it is 
postponed until the supplementary section. There we will 
content owrselves with a simple example: Firms charge 
the monopoly price as long as their profit has been high 
in the past. If a firm observes a low profit, which a priori 
could be dhte to a price cut by its rival or to a low demand, 
or if it itsdlf has undercut the monopoly price in the last 
period, it ¢harges a low price for some period of time T, 
and so does its rival (this is a punishment phase). The 
firms revert to the collusive phase (i.e., charge the monop- 
oly price) pfter the punishment phase is completed, until 
the next Loner or slump in demand. The increase in 
the probaljility of reversion to a punishment phase makes 
price cutting unprofitable for the firms, The model then 
predicts ppriodic price wars, contrary to the perfect- 
observatioh models of subsections 6.3.1 and 6.3.2. Price 


wars are involuntary, in that they are triggered not by 
a price cut but by an unobservable slump in demand. 
(Indeed, in the collusive phase both firms charge the 
monopoly price until their profit is adversely affected by 
a demand shock.) Note also that price wars are triggered 
by a recession, contrary to the Rotemberg-Saloner model. 

Under imperfect information, the fully collusive out- 
come cannot be sustained. It could be sustained only if the 
firms kept on colluding (charging the monopoly price) 
even when making small profits, because even under col- 
lusion small profits can occur as a result of low demand. 
However, a firm that is confident that its rival will con- 
tinue cooperating even if its profit is low has every 
incentive to (secretly) undercut—price undercutting yields 
a short-term gain and creates no long-run loss. Thus, 
full collusion is inconsistent with the deterrence of price 
cuts.24 


Exercise 6.7" A procurement agency repeatedly pur- 
chasing supplies for a federal, state, or local government 
uses sealed competitive auctions. Price quotations are 
opened publicly at a predetermined date. What are the 
potential adverse effects of this procedure? 


6.3.4 Discussion 


As we have seen, the supergame framework is simple 
and lends itself to many applications. This subsection 
identifies the main features of the supergame model and 
discusses its methodology. 


6.3.4.1 Synchronic Timing 


We assume that firms always choose their prices simul- 
taneously. That is, a firm's current profit is no longer 
affected by its rival's previous price choices when it 
chooses its own price. As is easily seen, the synchronicity 
assumption is not crucial to the main supergame result 
(the Folk theorem for low impatience), However, it im- 
plies specific conduct. The firms’ strategies are bootstrap 
strategies in the following sense: At any point in time, 
past prices do not affect current (or future) profits. Hence, 
the only reason a firm conditions its pricing behavior (in 


23, See Pattess 1983a for a test of the existence of price wars (an implication 
of the Green-Porter model) using data from the U.S. railroad industry in the 
1880s, 


24. For an example of a supergame showing that full collusion between two 
parties may be impossible even for 5 close to 1 if these parties’ actions are 
observable only with noise (Le., under “bilateral moral hazard), see Maskin et 
al. 1986, Fudenberg. Levine, and Maskin (1988) provide general conditions for 
fuil collusion to be sustainable. 
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collusive equilibria) on previous price choices is that the 
other firms do so. The achievement of collusion stems 
from al subtle self-fulfilling expectation. Firms do not 
follow bimple business strategies, such as regaining one’s 
ra hare when the latter has been (and still is) eroded 
by one rival's aggressive price behavior. In a supergame, 
the wie share is no longer eroded by the time the firm 
reacts. |f we eliminate the bootstrap equilibria, we are left 
with nancollusiveness (repeated Bertrand behavior). 


6.3.4.2 | Infinite Horizon 


As we |saw, collusion cannot be sustained in the super- 
game framework, even for a long but finite horizon. Thus, 
an eee horizon is crucial to the results. This raises 
the possibility that the results are not robust to finite- 
length price interaction—a reasonable assumption. The 
fate orn assumption need not be taken too seri- 
ously. Suppose that at each period there is a probability x 
in (0, 1) that the market “survives,” i.e., that the firms keep 
competing on this market (one can think of 1 — x as the 
probability that the good becomes obsolete or intense 
competition emerges). The game then ends in finite (but 
acc time with probability 1, However, everything, 
is as if the horizon were infinite and the firms’ discount 
factor Were equal to 6 = dr, as is easily checked on the 
pat hn (explanation: The future is discounted 
with distount factor 6, but exists only with probability x). 
Thus, if} both 4 and x are sufficiently high, supergame 
collusion can be enforced. Note, however, that this result 
telies on probability x being constant over time. Un- 
fortunatply, little is known about the set of equilibria 
when, for instance, a time-varying x, declines sharply at 
some ppint in time (although one would suspect that 
supergame collusion would be hard to sustain in such an 
environment), 


6.3.4.3 | Multiplicity of Equilibria 


in explaining tacit collusion. The multiplicity of equilibria 


As was fie earlier, the supergame story is too successful 
is an embarrassment of riches. We must have a reasonable 


and systematic theory of how firms coordinate on a partic- 
ular equilibrium if we want the theory to be predictive 
and allow for comparative statics. One natural method is 
to assume that the firms coordinate on an equilibrium that 
yields a Pareto-optimal point in the set of the firms’ equi- 
librium profits. One can further cut the set of remaining 
equilibria by choosing a symmetric equilibrium if the 
game is symmetric. This is a useful methodology, but it 
raises two problems. First, the game may be asymmetric 
(e.g. because of intrinsic cost differences), or it may be- 
come asymmetric when a richer context is studied (e.g. 
when firms make investment decisions).?° Second, the 
choice of an efficient equilibrium raises the issue of “re- 
negotiation.” Suppose that the firms initially coordinate 
on the monopoly-price equilibrium, and that some firm 
deviates by undercutting in the first period. The equilib- 
rium strategies specify some punishment from period 2 
on. For instance, we considered the maximal punishment 
in which firms charge the competitive price forever after 
a deviation. But the firms, who expect no profit from 
period 2 on, have an incentive to renegotiate to avoid the 
punishment phase and reach an efficient equilibrium anew. 
And, indeed, there is no reason why the firms could not 
coordinate on an efficient equilibrium at date 2 if they 
were able to do so at date 1,7° 

The possibility of renegotiating undermines the strength 
of punishments and, therefore, the incentive not to under- 
cut. (To impose discipline, firms would prefer to com- 
mit to not renegotiate; but such a commitment is not 
credible.) Farrell and Maskin (1986), Pearce (1987), and 
van Damme (1986) offer analyses of supergames that 
account for the possibility of renegotiation. 


6.4 Price Rigidities 


As was discussed in subsection 6.3.1, the above super- 
game description of repeated price interaction is very 
special in that firms never react to variables that currently 
affect their profits. In reality, prices cannot be adjusted 
continuously, Firms incur costs in deciding price changes, 


25, Even se the firms are symmetric and make symmetric investment 


decisions ir] equilibrium, they must contemplate the outcome of the repeated 
price game under asymmetric investments when they make their investment 
decisions 
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26. Of course, the managers of a firm that has been undercut often comment 
that “the managers of the other firm cannot be trusted to cooperate anymore. 
But such a comment refers to the firm's managers learning information about 
the other firm or its managers. Hence, it relates more to the situation described 
in section 65 
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sending néw price lists and catalogs to retailers, changing 
price tags, advertising price cuts to consumers, and so on. 
These “menu costs” are generally small, so firms are able 
to change their prices frequently if they elect to do so. 
However, |changing one’s price every day or every minute 
would often be prohibitively expensive, so prices are likely 
to exhibit |short-run rigidities, Beyond price rigidity, there 
are other ¢hannels through which past price choices affect 
current pfofits, On the demand side, past prices may 
affect the| firms’ current goodwill through consumers’ 
learning about the good or switching costs. On the supply 
side, past| prices affect current inventories (or current 
workload,|if orders take time to be filled). 

The prpsence of price rigidities raises the possibility 
that price] reactions are not bootstrap reactions but are 
simply sees to regain or consolidate market share. 
The simplest way to (roughly) formalize short-run rigid- 
ities and teactions to payoff-relevant prices is to assume 
that firms} choose their prices asynchronously.?’ For sim- 
plicity, let us consider two firms producing perfect sub- 
stitutes. At odd (respectively, even) periods, firm 1 
(respectively, firm 2) chooses its price. A price p, chosen 
by firm / at date t is fixed for two periods: pj... = pi... In 
period t + 2, firm i may choose a new price, which again 
will be lotked in for two periods. (Our exogenous asyn- 
chronicity assumption will be motivated later on.) 

Firm i’s objective is to maximize the present dis- 
counted hee of its profit: 

' 


Y FTW eyed 
0 

Now the| model is identical to the supergame model, 
except that asynchronicity replaces synchronicity. 

We logk for a perfect equilibrium in which the firms’ 
price chojces are simple in that they depend only on 
the “payoft-relevant information.” More precisely, at date 
2k + 1, firm 2 is still committed to the price (3,24) it 
chose a ppriod earlier, This price affects firm 1's profit at 
date 2k +) 1 and, therefore, will be termed payoff-relevant. 
We assume that pi.24+1 = Ri(p2.2)—that is, firm 1's 
strategy if conditioned on as little information as is con- 
sistent with rationality (no bootstrapping). And similarly 
for firm P: po.24+2 = R2(Pr,2e+1)- Ry(-) and R2(+) are 


called Markov reaction functions. A Markov perfect equi- 
librium is a perfect equilibrium in which the firms use 
Markov strategies. For any current price p2,2, at time 
2k + 1, firm 1's reaction must maximize its objective func- 
tion given that the firms will react according to R,(-) and 
R,(-) in the future. Mathematically, firm 1’s intertemporal 
profit from date 2k + 1 on when it reacts to p>, 2% = p2 
by choosing py,24+1 = Pi is 


V* (pz) = max [TT*(p,, p2) + ST1'(p,, (py) 
mi 


+ &I'(R,(Ro(p)), Ra(py)) +o). 


since firm 2 will react with R(p,) in the next period, and 
then firm 1 will react in two periods to R3(p,) with 
R,(R3(p;)), and so on. (Here we are appealing to the 
“one-period-deviation criterion.” It is clearly necessary for 
an equilibrium that no firm wants to deviate from the 
reaction rule for one period and then conform to this rule. 
Conversely, this one-period-deviation criterion is suffi- 
cient for equilibrium, as an intertemporal deviation from 
the rule can be decomposed into sequential one-period 
deviations. See section 6.7.) In equilibrium, p, = Ry(p2) 
must maximize the expression in brackets for all p,. Firm 
2 behaves similarly. 

In subsection 6.7.2 we will derive the conditions to be 
satisfied by an equilibrium pair of reaction functions; here, 
we content ourselves here with a simple example and 
consider some implications of the model. 


6.4.1 A Kinked-Demand-Curve Example 


Let D(p) = 1 — p, and let c = 0 for the two firms. The 
price grid is discrete: p, = h/6, where h = 0,1,...,6. 
Now, po = 0 is the competitive price, and p; = 4 is the 
monopoly price. Consider the symmetric reaction func- 
tion R,(-) = R,(-) = R(-) in table 6.1, (The right-most 
column indicates the industry profit when the lowest price 
charged is p.) These strategies can be shown to form an 
equilibrium for any discount factor sufficiently close to 1 
(rapid price adjustments), See section 6.7. 

This equilibrium is highly reminiscent of the kinked- 
demand-curve story. The focal price (the steady state) is 
the monopoly price p; here. Starting from this price, if a 


27. The follqwing exposition follows Maskin and Tirole 1988. Eaton and 
Engers 1987 study this morte! with differentiated products. 
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Table 61 
p [ R(p) (30)N(p) 
Pe Ps 0 
Ps Ps 5 
Ps Ps 8 
Ps Pa 9 
Pa PB 8 
3 with probability « 5 
id p: with probability 1 — a 
Po Ps 0 
Note: 4 depends on 6. 
firm taises its price, its rival does not follow suit; it stays 
at i focal price. If the firm undercuts to p3, its rival 
reacts with a price war, In this particular equilibrium, the 


price |war has two stages: In turn, the rival undercuts to 
p,. Af that low price, the firms then engage in a “war of 
attritipn.” Both firms want the price to go back up to the 
focal price. However, each firm would like the other firm 
to move first, because the relenting firm loses market 
share jin the short run. The outcome is a mixed-strategy 
behavior, in which each firm either continues the price 
war i cracks and raises its price. The difference with the 
kinked-demand-curve (KDC) story is that the reactions 
are om and fully rational. (A small difference lies in 
the fact that the price-war phase here differs from the 
matching behavior of the KDC story.) 

Let|us, for instance, check that at the focal price no firm 
wants|to undercut. Given the equilibrium strategies, firm 
1, say| gets an intertemporal profit (all profits are multi- 
plied by 36) of 


V(ps) |= (1 + 6 + 6? + 63 + ---)4.5 = 45/1 — 8), 


By undercutting to p2, it gets a profit of 8 today. In the 
next ee it is undercut to p, by firm 2 and, therefore, 
gets 0.) Two periods from now, it is firm 1’s turn to choose 
a price. According to the equilibrium strategy, one opti- 
mal sch is to revert to the focal price. (To compute a 
psd at one can take any equilibrium action when 
the fir plays a mixed strategy, because the equilibrium 
conditipn is that all actions played with positive probabil- 
ity yield the same payoff.) Firm I makes no profit in that 


period, and brings the industry back to the focal price 
from the following period on. Thus, undercutting to pz 
yields 


8+ 5-04 67-0 + (6 + 5* 4 ---)45 < V(py) 


for 6 close to 1, This computation illustrates the trade-off 
between the short-run gain from undercutting and the 
longer-run loss due to the price war. Firm 1's profit is 
increased by (8 — 4.5) = 3.5 today, but is reduced by 4.5 
in the following two periods (if we suppose that firm 1 
elects to relent at price p,). 


Remark 1 The above equilibrium suggests a “strategic” 
theory of excess capacity. We have assumed to this point 
that the firms can satisfy their demand. Now, let us 
assume that they must first install their capacities before 
competing over prices (as in chapter 5, but with dynamic 
price competition as the “second stage”). In the stationary 
state, the firms need capacity equal to half the total mar- 
ket demand at p;—that is, . However, a firm's threat to 
charge p, if the other firm charges p, is no longer credible 
if the capacity of the former firm is 4. At price p,, 
the former firm is far from able to satisfy demand (3) 
Equilibrium at ps is no longer an equilibrium when this 
level of capacity is chosen. In fact, it can be shown that 
the firms benefit from installing capacities which they will 
not use, but which they would use if the other firm 
became more “aggressive.” ?* 


Remark 2 There also exist equilibria in which the price 
never settles. For instance, in the previous example, the 
symmetric strategies {R( pg) = R(ps) = pai R( ps) = psi 
R(p3) = pri R(p2) = pri R(pr) = por and R( po) = Pa 
with probability B and ps with probability 1 — 8} (where 
B depends on 65) form another equilibrium for 6 close 
to 1. At this equilibrium, market dynamics consist of 
a price war followed by a rapid relenting phase followed 
by a new price war, and so on. A market onlooker 
thus observes a cyclical path of market prices. Each firm 
undercuts because, with good reason, it anticipates that 
maintaining its price will not prevent the other firm from 


28. See Maskin and Tirole 1985 for an example of strategic excess capacity: see 
Benoit an Krishna 1987 and Davidson and Deneckere 1985 for examples of 
similar behavior in a supergame context. Rotemberg and Saloner (1985a) make 
a similar point with respect to inventories. A nonstrategic reason for maintain- 
ing exces} capacity is the fluctuation in demand over time (either in a seasonal 


manner or over the business cycle). See Arrow. Beckmann, and Karlin 1988 and 
(for a related idea in an inventory context) the contributions in Arrow, Karlin, 
and Scarf 1958, Mills 1962, and Zabel 1972, See also the discussion of inven~ 
tories in chapter 1 of the present volume 
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being ag; 
attitude,?° 


ssive. In that sense, mistrust is a self-justifying 


competitive profit (which is 0). For instance, the average 
industry pfofit in a symmetric equilibrium is equal to at 
least half of the monopoly profit for 6 close to 1. The 
intuition here is that if firms were stuck in the competitive 
price regign, with the prospects of small profits in the 
future, a firm could raise its price dramatically and lure 
its rival tq charge a high price for at least some time 
(the rival “would not hurry back to nearly competitive 
prices), Thyss, tacit collusion is not only possible (as in the 
supergame approach) but necessary. Furthermore, it can 
be shown |that there exists only one pair of equilibrium 
strategies that sustain industry profits close to the monop- 
oly profit.|These strategies (which are described in sub- 
section 6.7.2) form a symmetric kinked-demand-curve 
perce at the monopoly price, and they are the only 
symmetric “renegotiation-proof” equilibrium strategies 
(whatever|the current price, the firms cannot find an alter- 
native Mafkov perfect equilibrium that they both prefer). 
Price rigidities suggest the possibility that price adjust- 
ments are more sluggish during booms than during busts. 
The point} is that during a bust in demand, price adjust- 
rate downward and thus result in a temporary 
the market share of the firm initiating the 
. During a boom, each firm is reluctant to 
adjust (upward) because this results in a temporary loss of 
market share.*° 
Asynchronicity was exogenously imposed in order to 
allow the formulation of the notion of reaction to payoff- 
relevant variables, But it can be shown that if firms can 


choose their prices whenever they want, subject to the 
constraint that their price is locked in for two periods 
once it is chosen (short-run commitment), the firms effec- 
tively end up moving asynchronously. This vindicates the 
use of alternating timing. Of course, the two-period com- 
mitment is a conceit. Optimally, one would wish to de- 
scribe the price frictions evoked at the beginning of this 
section in more detail. Gertner (1986) takes this route, and 
assumes that there exists a fixed menu cost of changing 
prices.*! His results reaffirm the previous ones (in par- 
ticular, that the monopoly price is sustainable by some 
Markov-perfect-equilibrium strategies, and that equilibrium 
profits are bounded away from the competitive profit). 

Price rigidities cannot formalize tacit collusion in situa- 
tions of repeated auctions (see page 222 of Scherer 1980 
for a description of collusion and breakdown of collusion 
in the antibiotic industry, where the federal government 
was the major buyer), Then new prices are quoted in each 
auction, and the timing is constrained to be synchronous. 
(Ortega-Reichert (1967) has developed a theory of tacit 
collusion for first-bid auctions, which is similar to the 
theory of tacit collusion developed in section 6.5 below. 
The idea is that by charging a high price in the current 
auction, each firm tries to signal to the other firms that it 
is less cost effective than it really is, and thereby to 
indicate that it will not price aggressively in the next 
auction. It thus induces its rival to price less aggres- 
sively in the future, The consistency of this strategy with 
rational behavior will be examined in section 6.5 and in 
chapter 9 in related contexts.) 


6.5 Reputation for Friendly Behavior 


Section 6.4 shows how short-run price rigidities can sus- 
tain price collusion. More generally, this approach pre- 
sumes that firms react to payoff-relevant variables that 
are determined by past price choices (the payoff-relevant 
variables can be the prices themselves, but they can also 


29. Such an dquilibrium price cycle arises in the absence of capacity constraint 
(Edgeworth stiggested the possibility of price cycles as a consequence of the 
potential nonpxistence of a pure-strategy equilibrium in the static price game 
with capacity|constraints—see chapter 5 above.) 


30, See Maskin and Tirole 1988 for a preliminary example of such behavior 
based on unforeseen demand shocks and using the renegotiation-proof equilis- 
rium behaviofs. 
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31 The literature on menu costs stems from Barro 1972, Sheshinski and Weiss 
1977, and Sheshinski and Weiss 1983. Important recent research in this area (in 
monopoly or competitive frameworks) has been done by Caplin and Spulber 
(1987) and Bénabou (1985). In macroeconomics there exists a large literature on 
costly price adjustments. Of particular relevance to the dynamic study of effects 
of menu costs is Blanchard’s (1983) analysis of asynchronic price movements. 
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stem (less directly) from consumer inertia or from tech- 
nologifal intertemporal links). It thus emphasizes the role 
of tangible variables. In reality, history matters not only 
through its effect on tangible variables but also through 
the information it conveys about competitors. Oligopo- 
lists are concerned with many variables that they cannot 
estimate precisely, Among these are the cost structure (or, 
more generally, the objective function) of their rivals, the 
state of demand, and the potential of the market. Beyond 
individual or industry surveys, firms rely on market infor- 
matior| (such as past price choices and observed demand) 
to estimate these variables. Thus, history also matters 
through intangible variables (beliefs). This raises the pos- 
sibility) that the firms may want to manipulate their rivals’ 
informhtion in order to derive benefits later on. 

Chapter 9 is devoted to the general study of strategic 
interaction under incomplete information. The purpose of 
the prpsent section is to suggest why asymmetries in 
informhtion are likely to induce firms to raise their prices 
ina sitpation of repeated price interaction. 

The|basic intuition (which will be systematically devel- 
oped im chapters 8 and 9) is the following: In a one-shot 
interaction, a firm is more likely to charge a high price if 
it knows that its rival is likely to charge a high price (this 
step mhist be taken for granted for the time being, but it 
is quit intuitive), Hence, because a firm likes its rival to 
charge|high prices, it would like to convince its rival that 
it is likely to charge high prices itself. In a one-shot 
situatiqn, there is little that a firm can do to convince its 
rival off this. Talk is cheap and cannot be trusted to supply 
useful |nformation. In a dynamic framework, however, a 
firm cah send costly signals. For instance, in a two-period 
price game, a firm can in the first period charge a price 
that exceeds its first-period expected-profit-maximizing 
price in| order to signal that it is likely to charge a high 
price im} the second period. (This signal may be credible 
precisely because it is costly—it implies a first-period 
sacrifict of profit.) Why should a high first-period price 
convey the information that the firm is likely to charge 
a high| second-period price? Suppose that the firm has 


private information about its production cost (or about 
the demand curve).*? In a one-shot relationship, the firm 
tends to charge a high price if its marginal cost is high (or 
if it has information that demand is high) and a low price 
otherwise. Thus, absent strategic manipulation of informa- 
tion, a high first-period price reveals that the firm has 
a high cost (or that the demand is high). If costs (or 
demand) are positively correlated over time (in particular, 
if they are time invariant), the firm will also charge a high 
price in the second period. Hence, a rival ought to use the 
information conveyed by the firm’s first-period price to 
predict its second-period behavior. Of course, a rational 
firm realizes this and generally wants to manipulate its 
rival's information. To do so, it charges a price above 
the first-period expected profit-maximizing price given its 
marginal cost (or its information about demand), In turn, 
the rival ought to anticipate the firm's incentive to charge 
high prices, and modify its inference process accordingly. 
The proper extension of the Nash perfect-equilibrium 
concept to this type of situation is described in section 
11.5 below. The study of equilibrium indeed confirms 
the intuition just developed. In a repeated price game with 
asymmetric information about marginal cost or demand, each 
firm sacrifices short-run profit by raising its price in order to 
build a reputation for charging high prices. (Of course, in a 
two-period game this effect operates only in the first 
period, In the second period, each firm maximizes its 
expected second-period profit given its second-period in- 
formation, because there is no future to carry reputation 
over to.) 

As was mentioned in the previous section, this rea- 
soning may be applicable to repeated auctions. Suppose 
that the government procures its supplies (concrete, wea- 
pons, antibiotics, etc.) through first-bid auctions.*? In a 
single auction, a firm generally does not charge a price 
equal to its marginal cost. By charging above its marginal 
cost, it makes a positive profit in case the other firms bid 
even higher. (This departure from the Bertrand outcome 
can be attributed to the possibility of a cost differential 
between the firms. See the Game Theory User's Manual 


‘operate. 1{ can also jam its rivals’ information through the choice of unobsery 
able actiofs (here, secret price adjustments), as in Riordan’s (1985) model. See 


32.4 ie not have private information for the phenomenon described to 
chapter 9 


low. 


33. A first-bid auction has all the bidders announce a price at which they are 


willing to supply the good; the lowest-price firm is picked and receives its bid 
to supply the good. A first-bid auction is thus equivalent to Bertrand com- 
petition with perfect substitutes (a minor difference is that the quantity bought 
by the buyer—e.g.. the government—is often fixed, i.e, inelastic, rather than 
due to a downward-sloping demand), 
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a” Game 


Roe Nakh equilibrium is (F, F). fithoush the paooers’ 
dilemmalis not a faithful representation of a price game, it 
contains|some elements that illustrate price competition 
and the Hertrand paradox. In particular, both players would 
be bettet off cooperating (charging a high price) rather 
than fink}ng (undercutting), but each player is individually 
better off finking, It also shares with the Bertrand price 
game th¢ property that repeating the game T times does 
not help |sustain collusion: In the last period, both players 
will fink; so they will in the penultimate period, and so on. 
Thus, unfess the game is repeated an infinite number of 
times (in| which case, if the discount factor is sufficiently 
large, collusion—i.e. (C, C)—can easily be sustained by 
means of threats to revert to (F, F) in the future if one of 
the playdrs does not cooperate today), the (F, F) equilib- 
rium is sfill the only noncooperative equilibrium in each 
period. 

The basic insight of Kreps et al. 1982 is that a small 
uncertainty about the preferences of the players (techni- 
cally, about the above payoff matrix) can have a sig- 
nificant amount of influence on the players’ behavior if 


Figure 6.4 


the game is repeated long enough (but not necessarily 
repeated an infinite number of times). To see this, suppose 
that with probability 1 — % each player is “sane,” mean- 
ing that his payoff is then given by the payoff matrix in 
figure 6.4 (so that, for instance, the payoff corresponding 
to (FC) is 4 for player 1). With probability a, each player 
is “crazy.” His preferences are not given by the previous 
payoff matrix; rather, he behaves in the following way: 
He starts cooperating at date 1. He continues cooperating 
at date t as long as his opponent has cooperated up to 
that date; otherwise he finks, (One can think of this crazy 
type as representing a preference for cooperation—or a 
distaste for finking—coupled with a strong desire to 
punish a noncooperative opponent.) “Craziness” does not 
reflect a value judgment, but refers to behavior (or pref- 
erences) that deviates from the norm (which is associated 
with preferences defined by the payoff matrix), One can 
think of « as small if one feels that craziness is relatively 
unlikely. 

Let us briefly return to our metaphor, in which a firm 
can charge one of two prices (cooperating corresponds to 
charging the high price, and finking to charging the low 
price). There, one might think of the “sane type” as facing 
a low production cost (and therefore benefiting from an 
unanswered price cut), and the “crazy type” as facing a 
high enough cost (say, above the low price) that it never 
pays to undercut the rival's high price.**3* 

Let us assume that the game is repeated from t = 0 to 
+ =T, and, to simplify computations, that the discount 
factor 6 is 1. Each player knows his preferences (sane” or 
“crazy”), but does not know his opponent's. Because the 


34, On this frterpretation, a need not be small 
35. The methphor is imperfect as it stands, in that it is not clear why a 


high-cost firm should charge a low price after a deviation. The reason why this 
prisoner's dilemma is useful in building some intuition about the price game will 
be given in chapter 9, 
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behavior 3 crazy type is determined, we will derive 
player i’s strategy assuming that he is sane, 

Suppose| that at date 0 player 1 finks. Because a crazy 
player nevér takes the initiative to fink first, player 1 must 
be sane. Both players finking from date 1 on (up to the 
end of the fame) is an equilibrium: By the very definition 
of crazinegs, player 2, if he is crazy, retaliates for the 
date-0 deviation from cooperation. If he is sane he might 
as well fink, given that player 1 will fink no matter what. 
Similarly, player 1, who is sane, cannot do better than to 
fink. This petually is the only equilibrium from date 1 
on*® after |a deviation from cooperation at date 0. This 
means thaf if player I is sane and finks at date 0, his 
payoff is ‘ most 4 (he gets at most 4 at date 0, and gets 
Oat dates },.,., T). Now, consider the strategy that con- 
sists for player 1 in cooperating up to T unless player 2 
deviates at} some date f, in which case player 1 finks from 
++ 1 to 7}, This strategy need not (and actually is not) 
optimal fog player 1 if he is sane; however, as will now be 
shown, it Hominates finking at date 0 for T sufficiently 
large, and therefore finking at date 0 cannot be optimal 
even for alsane player. If player 2 is crazy, player 1 gets 
3(T + 1) from this strategy. If player 2 is sane, player 1 
gets at worst —1, because he gets caught cooperating 
while his ppponent finks for at most one period, after 
which he tin noncooperatively. Hence, player 1’s pay- 
off from this strategy is at least 


a[3(T + 1))+(1—a)(—1)>4 


for T suffitiently large. However small 2 may be, there 
exists Ty uch that for T > Ty finking at date 0 cannot 
be optima] for player 1 even if he is sane. And similarly 
for player|2. This means that, for a sufficiently large hori- 
zon, each player cooperates at the beginning of the horizon, 
even when from a myopic (static) perspective finking is 
a doin strategy for a sane player. More generally, 
cooperatidn by both players is observed as long as ! < T 
~ T.37 


The intuition for this result is straightforward. Each 


player, by cooperating, exposes himself to the risk that 
the other player finks, and that he obtains a low profit 
during one period (after which, having learned his lesson, 
he will no longer try to cooperate). However, by finking, 
he reveals that he is not of a cooperative (crazy) type and, 
therefore, loses future gains from cooperation if the other 
player is of a cooperative type. If the horizon is long 
enough, the loss of future cooperation exceeds that for 
being cheated on. In a sense, at the beginning of their 
relationship, the players want to keep their reputation for 
possibly be willing to cooperate; that is, they do not want 
to reveal that they are noncooperative. 

Because we have obtained our main result (that co- 
operation can be sustained for sufficiently large horizons 
even for small probabilities of craziness), we do not give 
a complete description of the equilibrium of this game, A 
sane player cooperates for some period of time. Then, 
near the end of the horizon, he starts milking his reputa- 
tion; the loss of future cooperation becomes comparable 
to that of being cheated upon. Hence, collusion breaks 
down at the end of the relationship (if at least one of the 
players is not cooperative).** 


6.5.2 Discussion 


A striking feature of the model in subsection 6.5.1 is the 
powerful effect of small uncertainties about each other's 
objective function on equilibrium behavior if the horizon 
is sufficiently long and the players are not very impatient. 
As Kreps et al. (1982) show, cooperation is sustainable for 
a small probability x of being crazy, whereas it is not for 
a=0. 

Of course, the possibility of sustaining collusion or 
another outcome depends on the type of craziness one is 
willing to assume. For instance, if we assumed that a crazy 
player likes to cooperate no matter what (that is, is not 
“revenge hungry” after his opponent cheats), collusion 
would not be sustainable for any 2.°° Finking would 
imply no future loss of cooperation with a crazy player, 


40, At date [. everybody finks (the sane players do so because finking is a 
dominant strategy in the last period: a crazy player 2 does so because he still 
rétaliates for the period-O deviation), The same is true at date T — 1. and al any 
date | > 1. by backward induction. 


37, If both types of players’ best strategy is to cooperate at date 0. .... t — 1. 


each player spill has posterior beliefs given by {I — 2.2) at date } when both 
have coaperafed until then. 
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38 The equilibrium and its derivation are formally similar to the model of a 
monopolist’s reputation for quality, See the supplementary section of chapter 2 
39. By backward induction, finking is a “dominant” strategy for the sane type 
at each date, Thus, the sane players play F in each period, and not only at the 
end of the horizon. 


Dynamic Price Competition and Tacit Collusion 


and therefore nothing would deter a sane player from 


ying “tit for tat.” (The tit-for-tat strategy is to 
at date 0 and then do whatever the other 


th sensitivity of equilibrium behavior to beliefs 


(1986a) show that a kind of Folk theorem holds, i.e., “any 
outcome’ can be sustained as an equilibrium of a suffi- 
ciently lohg and little discounted game with incomplete 
information, as long as one is willing to assign a probabil- 
ity % > OJthat each player is crazy in a particular way (see 
subsection 6.7.3 for more details). The set of equilibria is 
then ae bees as the set of equilibria for the infinitely 
repeated game (supergame) between the sane types only. 
Thus, wel again face an embarrassment of riches. While 
the supergame approach forces us to choose between a 
large mulfiplicity of equilibria of a single model, the repu- 
tation approach with long horizon (which, because of the 
finite horjzon, reduces the set of equilibria dramatically 
for a giv¢n model) offers a large multiplicity of models 
(descriptipns of uncertainty) and associated outcomes to 
choose frpm. 

The Fidenberg-Maskin criticism suggests two routes 
that can yield predictive power to the reputation model. 
The first tonsists of being reasonably confident about the 
sort of ciaziness that is likely to arise in practice.*° The 
second iq more consistent with the general neoclassical 
approach] of presuming full rationality (in the sense of 
profit-makimizing behavior) to obtain predictive power. 
According to this approach, each player views his oppo- 
nent as rational and is rational himself; however, he does 
not know some parameters, such as the opponent's mar- 
ginal cost{ or estimate of demand (see the above metaphor) 


Besides sticking to the rationality axiom, this approach 
has the advantage of looking at a type of asymmetric 
information which is more likely to be “large” (as op- 
posed to the probability of craziness, which we would 
presume to be “small). (The equilibria of dynamic games 
of incomplete information tend to be less sensitive to the 
exact specification of uncertainty when the magnitude of 
the uncertainty is non-negligible.) In chapter 9, we will 
generally take this second approach. 

It should, however, be pointed out that the approach 
outlined in subsection 6.5.1 was motivated by the many 
experiments that show that collusion is likely to be sus- 
tained in long but finite games, For instance, Axelrod 
(1980) proposed a prisoner's dilemma (such as the one 
illustrated by figure 6.4) to game theorists in economics, 
psychology, sociology, political science, and mathe- 
matics. The game was supposed to be repeated T = 200 
times. Axelrod ran the strategies submitted by the game 
theorists against one another in a round-robin tourna- 
ment. The highest average score was realized by the 
tit-for-tat strategy rather than the “rational” strategy of 
finking at each period. 

Such experiments point at the following ideas; Under 
such circumstances, collusion does not seem to be 
explained by the second approach—in contrast with 
industrial-organization environments, there is no asym- 
metric information about cost, demand, etc: If one assumes 
that each participant's preferences are increasing with his 
final wealth, there is no real asymmetry of information 
about payoffs between the participants. Thus, the failure 
to get defection (F) in each period seems to be traceable 
to the rationality axiom, And, indeed, it is reasonable to 
presume that at least a small fraction of the participants 
(the “crazy” players) have not performed the backward- 
induction reasoning that yields defection in each period, 
or are unable to do it, or believe that with some proba- 
bility some other participants have not performed this 
computation. The important contribution of the approach 
of Kreps et al. is to show that if the game is repeated long 


ting result due to Fudenberg and Levine (1987) states that under 
ins an informed player will. in the long run. succeed in convincing, 
he is “crazy” and that his form of craziness is the (sane) informed 
red” craziness. Informally, this preferred type of craziness is the 
‘one that, inh one-shot game, would induce the rival to take an action that is 


player is a long-run player who faces a (Jong) sequence of short 
8. players who play for a single period each), that the short-term 


players have observed the long-term player's past behavior, and that they put 
2 positive prior probability on the preferred type of craziness. The intuition 
behind the result is that, if the informed player is very patient and the horizon 
1s very long, it is worth taking the time and incurring the losses associated with 
building the best possible reputation. The time frame to build such a reputation 
ts however. longer than the one in the approach of Kreps et al in which 
craziness can take only one form, and the opponents do not have to distinguish 
among, various types of craziness 


Chapter 6 


pai even a sane player who has performed all the 
requited computations may want to behave like a crazy 
playef, and do better than he would do by finking. 


6.5.3) The Evolutionary Approach 


The feputation approach described in subsection 6.5.1 
relies}on a small probability of craziness. But it does not 
disasdociate itself totally from the rational approach, in 
that q large fraction of the population attempts to maxi- 
mize |total payoff. The evolutionary approach goes all 
the way by not requiring maximizing behavior at all, 
Howéver, it recognizes that, first, economic agents can- 
not, in the long run, use totally suboptimal rules (because 
they Jwill notice it or else disappear*') and, second, al- 
lowing all kinds of irrationality implies a complete loss of 
predi¢tive power, This approach—which was pioneered, 
in th¢ tradition of Darwin, by (among others) Alchian 
(1950), Hirshleifer (1977), and especially Nelson and 
Wint¢r (1982) in economics and Maynard Smith (1974, 
17a} in biology—looks at strategies, or (better termi- 
nology) rules, that are “robust” in the sense that they 
do fet well against a variety of other rules.*? For 
instarjce, Axelrod (1981) points out that in the repeated 
prisoner's dilemma, tit for tat tends to be a robust rule 
becaube it cooperates, is provoked into retaliation by the 
Saletan of the opponent, and yet is forgiving after it 
takes |its one retaliation. (In contrast, the “always fink” 
tule fdrbids gains from cooperation with cooperative types, 
as was seen in subsection 6.5.2, while the “always fink 
after 4 deviation” rule is not forgiving enough in case of 
a “migtake.’ +3) 

Tht evolutionary approach assumes that, in the long 
tun, qnly actors who use robust rules will stay around. 
The actors who use fragile rules will die (in a biological 
contekt) or go bankrupt (in an economic one), or will 
exper|ment with new rules; the more successful actors will 
see their rules imitated by new generations: 


The tological motivation for this approach is based on the 
interpfetation of the payoffs in terms of fitness (survival and 
fecunatty). All mutations are possible, and if any could invade 


a given population it would have had the chance to do so. Thus 
only a collectively stable strategy is expected to be able to 
maintain itself in the long-run equilibrium as the strategy used 
by all. Collectively stable strategies are important because they 
are the only ones which an entire population can maintain in 
the long run if mutations are introduced one at a time. (Axelrod 
1981, p. 310) 


The predictive power of the evolutionary theory re- 
mains to be determined. What is meant by “robust rule” 
is a priori contingent on the set and the probability distri- 
bution of mutations, and on the set of strategies the 
robust rule is allowed to be compared to. 


6.6 Concluding Remarks 


The full-fledged dynamic models of repeated price inter- 
action are complex; however, they provide the best 
approach for formalizing tacit collusion. Game theory 
imposes discipline by forcing the economist to think 
about and give a precise description of the strategic en- 
vironment (including, here, the frequency and timing of 
price changes and the information structure). In principle, 
this discipline allows for a better assessment of the rele- 
vance of the models; few would believe in a model simply 
because it formalizes some of the conventional wisdom 
(tacit collusion for high market concentration, kinked 
demand curve, price secrecy as a hindering factor, and 
so on). In the absence of conclusive econometric evi- 
dence of the predictive power of various models, precious 
information can be gained from looking at whether the 
very mode of competition presumed by the model (the 
“extensive form" of the game), and not only its outcome 
(the equilibrium prices or profits, say) fits the informal 
descriptions of the business press or those of the industry 
case studies, 

Why should we be interested in three (or more) ap- 
proaches to price collusion? The cynical view is that 
the absence of rigorous tests makes it difficult to reject 
an approach as irrelevant. But it may be the case that 
“theoretical heterogeneity” is needed here. As was men- 


41. Eg} by going bankrupt if there is sufficient experimentation or mutation, 


42, Th} recent literature on repeated games played by automata (eg., Abreu 
and Rubinstein 1986, Aumann and Sorin 1986, Kalai and Stanford 1986, and 
Rubinsthin 1986) should also be mentioned. 


43. Another advantage of tit for tat in an evolutionary context (in which 
successful strategies survive and grow) is that it allows the survival of other 
cooperative strategies (such as "always cooperate”), For a general overview of 
Axelrod’s approach to cooperation. see his book The Evolution of Cooperation 
(1984) and Milgrom's 1984 review thereof. 
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tioned in {he introduction, the rich array of behavioral 
patterns adross industries may justify a proliferation of 
theories. ths, the approaches developed in this chapter, 
and other approaches, may be complements rather than 
competitoys. 

This brings us to three important questions about 
“comparative modeling” that have been only partially 
addressed jn this chapter and in the literature: How would 
one go abbut testing the various approaches? Which one 
is most relevant in a particular industry? Do they gener- 
ate different policy conclusions? These difficult questions 
should be Riven high priority on the research agenda. 


44, The fact [hat different modes of competition have different informational 
contents is mbre general. For instance, Milgrom (1985) makes the point that 
collusion may| be easier to sustain for infinitely repeated “descending” auctions 
than for in'intely repeated sealed-bid auctions, Suppose, e,, that two firms 
facing the sarhe cost ¢ to supply a buyer quote prices. In a sealed-bid auction 
they quove prices simultaneously. In a descending auction the price goes down 
by itself until tc of the firms gives up, in which case the other firm supplies at 


the current pyfce. In a single auction or a finite number of auctions, the equilib- 
rium in both Fypes of auctions is the competitive price (the buyer pays cl. A 
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ES 
6.7 Supplementary Section: 
Dynamic Games and Tacit Collusion 


6.7.1 Secret Price Cuts 
6.7.1.1 Price Competition 


To formalize the intuition of section 6.3.3, we will now 
develop a straightforward model of collusion under im- 
perfect observability of prices. Because the Green-Porter 
model is slightly ad hoc in that it considers quantity com- 
petition and therefore assumes the presence of an auc- 
tioneer (if we are to take the informational assumption 
seriously), we shall depart from it by presuming direct 
price competition.** Thus, we shall make Stigler’s (1964) 
assumption that firms do not observe their rivals’ prices 
but rather infer them (imperfectly) from their own demand, 
The analysis, however, follows in the spirit of the Green- 
Porter model. 

The framework is that of the basic supergame. Two 
firms choose prices every period. The goods are perfect 
substitutes and are produced at constant marginal cost c, 
so the consumers all buy from the low-price firm. The 
demand is split in halves if both firms charge the same 
price. In each period, there are two possible states of 
nature, With probability a, there is no demand for the 
product sold by the duopolists (the “low-demand state”). 
With probability 1 — «, there is a positive demand, D(p) 
(the “high-demand state”), The monopoly price and the 
monopoly profits in the high-demand state are denoted 
by p™ and M™. We assume that the realizations of demand 
are independently and identically distributed (i.i.d.) over 
time. 

A firm that does not sell at some date is unable to 
observe whether the absence of demand is due to the 
realization of the low-demand state or to its rival’s lower 
price. It is always common knowledge that at least one 
firm makes no profit. If this occurs because of the absence 


repeated sealed-bid auction is similar to the repeated price game studied in 
section 6.3. Collusion can be sustained only if the horizon is infinile and the 
discount factor is sufficiently high. In a repeated descending auction, a deviation 
is discovered immediately (instead of one period later) and thus can be made 
unprofitable, even trom a static point of view (the deyiator's rival can refuse to 
give up before the price equals ¢ in the current auction), Collusion is feasible for 
any discount factor. The market organization thus affects detection lags and the 
amount of feasible collusion. 
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of demand, neither firm makes a profit. If this occurs 
becauge one firm undercuts the monopoly price, this firm 
knows that the other firm makes no profit. 

The all-or-nothing demand function is clearly con- 
trived, However, it allows the firms to face a nontrivial 
signal-extraction problem; A firm cannot say for sure 
whether a low demand stems from its rival's price cut. 
The model thus follows Stigler's approach. (With general 
demand functions for differentiated products, a firm could 
not perfectly infer a price cut.) 

In @ nonrepeated game or a finitely repeated game, 
both firms charge the competitive price ¢ (the Bertrand 
outcome), In the infinitely repeated version of the game, 
we lobk at an equilibrium with the following strategies: 
There|is a collusive phase and a punishment phase. The 
game begins in the collusive phase. Both firms charge p™ 
until one firm makes a zero profit (as was noted above, 
such an event is observed by both firms, even though a 
firm ihe not observe its rival's profit)*5 The occurrence 
of a zpro profit triggers a punishment phase. Both firms 
charg¢ c for exactly T periods, where T can a priori be 
finite br infinite. At the end (if any) of the punishment 
phase,| the firms revert to the collusive phase and charge 
p™ as long as they both make positive profits, 

Let lus look for a length of the punishment phase such 
that the expected present discounted value of profits of 
each sie is maximal subject to the constraint that the 
associated strategies form an equilibrium. 

The strategies are always optimal in the punishment 
phase.|Given that its rival charges the competitive price 
no matter what during T periods, each firm cannot im- 
prove bn the competiiive price itself. 

Let|V*' (respectively, V~) denote the present dis- 
counted value of a firm's profit from date ! on, assuming 
that atj date ¢ the game is in the collusive phase (respec- 
tively,|starts the punishment phase). By stationarity, V* 
and V7 do not depend on time. By definition, we have 


V* =((1 — a) (1/2 + 6V*) + a(6V-) (6.10) 
and 
Vo =eTV", (6.11) 


Equation 6,10 says that in the collusive phase, both firms 


charge p™. With probability 1 — 2, demand is high, firms 
make current profit I1"/2 each, and the game remains in 
the collusive phase, so that both firms have valuation V+ 
again next period. With probability 2, there is no demand 
today, and the game will be in the punishment phase 
tomorrow, Equation 6.11 yields the present discounted 
value of profits at the beginning of the punishment phase. 
During T periods, both firms make zero profit, after which 
they revert to the collusive phase. 

Last, we must add the “incentive constraint,” which 
states that no firm would wish to undercut in the collusive 
phase: 


Vt >(1—a)(I™ + 6V-) + a(5V >). (6.12) 


This equation expresses the trade-off for each firm. If 
a firm undercuts, it gets I > T™/2. However, under- 
cutting automatically triggers the punishment phase, which 
yields valuation V~ instead of V* (these effects exist 
only if demand is high; for a low demand, nothing 
changes). Thus, to deter undercutting, V~ must be suffi- 
ciently lower than V*. This means that the punishment 
must last long enough. But because punishments are 
costly and occur with positive probability, T should be 
chosen as small as possible given that equation 6.12 must 
be satisfied. Mathematically, equation 6,12 is equivalent 


to (using equation 6.10) 
§(V* —V-)> M2, (6.13) 


On the other hand, equations 6.10 and 6.11 yield 


: (1 — a)M1"/2 
~1—G—a)d— aT ay 
and 
oak Ty", 
y- —_(t— wert ae; 


1—(1—a)5 — ad?" 
Substituting equations 6.14 and 6.15 into equation 6.13 
yields, after some computations, 


1 <2(1 — a6 + (2a — 1)d7*". (6.16) 


Because the game starts in the collusive phase, the 
highest profit for the firms is obtained by solving the 
following program: 


45. This Jia possesses the same “bang-bang” property as its complete- 
informatiqn counterpart j.. there is no reason for the firms to try to implement 


any price less than p™ (in contrast with the Green-Porter quantity model—see 
exercise 6.9), 
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max V* 


subject to equation 6.16. 

As is easily verified from equation 6.14, V~ is a de- 
creasing function of T; that is, longer punishments reduce 
the expected profits. So we must choose the lowest possi- 
ble T crt with equation 6.16. (The fact that equa- 
tion 6.16 is satisfied for T = 0 confirms our remark 
that non-negligible punishments are required.) The right- 
hand side of dquation 6.16 increases with T if and only if 
% <4. Thus, jf « > 4, there exists no T satisfying equa- 
tion 6.16, The intuition for this is that the temptation to 
undercut incrgases when the expected gains from future 
collusion dectease. If x <4, then the right-hand side of 
equation 6.16/is maximal for T= +00, 

Assuming that (I — 2)6 > 4 guarantees that the high 
price is sustainjable using maximal punishments (T = +0). 
It generalizes the condition for deterministic demand, 
which corresponds to x = 0 (see section 6.3). 

To maximize V* subject to the incentive constraint 
6.16, it thus fuffices to choose the smallest T that satis- 
fies the incentive constraint. We thus obtain a (finite) 
optimal length of punishment.*° 


Exercise 6.8°* | Work your way through the Green-Porter 
model with [price competition and two i.i.d, states of 
demand (no Hemand or positive demand). Rederive the 
optimal length of punishment. Show that for a probability 
a = } of no femand, 5 must exceed 3 in order for collu- 
sion to be sustainable; show also that at least two periods 
of punishment are required. 


6.7.1.2 Quantity Competition 


Exercise 6.9°%* Consider the Green-Porter model with 
quantity as a ¢hoice variable. Assume that the market price 
at time fis 


Pr = OPqa, + gar) 


distributed akcording to the ed.f. F. Firm i observes only 
P,. not 4, or qq. Let 


Nq) = E[(0 P(2q) — cq) 


where 6, is : iid. multiplicative demand shock at date f, 


denote the expected per-period profit when both firms 
produce q, and let g° denote the symmetric Cournot out- 
put (which we assume is unique). Thus, q° maximizes 


Tqi, 4°) = E[(8 Pla, + 4°) — cai). 


Suppose that a price war is triggered when p, falls under 
some threshold level p*. Thus, when the firms’ outputs 
are q, and q,, the probability of a price war is 


a 
»9)) = Fi ——— ]. 
ata A) Ge + “4 
Let q* and q* denote the per-firm outputs in the collu- 
sive and punishment phases (q* <q‘). Note the con- 
straint that punishments must be Cournot. 
(i) Using the same notation as in the text, show that 


_ 1G*) 
“1-6 


Tq") — Tq°) 


¥ 1—a*67! —(1—a*)5' 


where a* = a(q*,q*). 
(ii) Again using the same notation, show that the opti- 
mal output q*, trigger price p*, and length of punishment 
T are given by 
max V* 
(ee Ty 
at tg") 
5 4 
On, T(q*) — Nig*) 
=65 ar 
ag, 7 4 \(; =e ua 


(The optimal T can be finite or infinite.) 

(iii) Use intuitive reasoning to argue that q* > q™, 
where q™ maximizes the expected value of (8 P(2q) — cq. 
(Hint: Explain in terms of first- and second-order effects 
starting at q* = q™.) Show that q* converges to q™ when 
the amount of noise tends to zero (the distribution of @ 
becomes degenerate). 


—§r 
a 6"). 


The Green-Porter equilibrium (described in exercise 
6.9) is optimal in a limited class of strategies. First, punish- 
ments are restricted to Cournot punishments, while worse 
punishments exist (some intuition about this may be 


46. A residual istue is that the root of equation 6.16 may not be an integer 
One may hen es ke hae nee above th roo ote 
randonuzing devjce to determine whether the length of punishment is this 


integer or the highest integer under the root (where the probabilities are chosen 
so that the constraint 6.16 is exactly binding) 
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gaingd from section 6.7.3). Second, the price war is based 
on a|"‘tail test.” It is triggered by a price p, lower than 
soma threshold level p*. so no use is made of the actual 
level] of p, beyond its comparison with p*. Thus, the 
mod#l presumes special forms for the collusive and pun- 
ihm phases. Abreu, Pierce, and Stacchetti (1985, 1986) 
look |at optimal collusive strategies without restrictions 
on the class of strategies. Two assumptions that allow 
these authors to obtain very crisp results are that the 
firms] objective functions are concave and that the condi- 
tional distribution of the market price p,, given aggregate 
outpit Q,, satisfies the “monotone likelihood ratio prop- 
erty”|(MLRP), Informally, the latter condition means that 
a low p, is more likely to result from a high Q, than from 
a lowjone.*” 

Abjeu et al. show that one can indeed restrict attention 
to a cbllusive phase and a punishment phase, characterized 
by payoffs V* and V~, where V* and V~ are now the 
best fw worst elements in the set of symmetric perfect- 
equilibrium payoffs. Furthermore, the collusive phase and 
the phinishment phase take simple forms. In the collu- 
sive phase, the firms produce output q*. The punishment 
phase is triggered by a tail test, i.e., it starts if the mar- 
ket ptice falls under some threshold level p*. Thus, the 
collus}ve phase is qualitatively similar to that presumed 
in Porter 1983b and in Green and Porter 1984. The 
pin phase, however, does not have a fixed length; 
rather} it resembles the collusive phase. The two firms 
produke (presumably high) output q~ each. If the market 
price pxceeds a threshold price p~, the game remains in 
the Ppa phase; if it lies below p~, the game goes 
back tp the collusive phase. Thus, the evolution between 
the two phases follows a Markovian process. The reader 
may be surprised by the “inverse tail test” in the punish- 
me hse The idea is that a harsh punishment requires 
a high output (higher than is even privately desirable); to 
ensurd that the firms produce a high output, it is specified 
that im the case of a high price (which signals a low 
eaoah the game remains in the punishment phase.** 


6.7.2 Price Rigidities and the Kinked Demand 
Curve 


The following is a technical analysis of the alternating- 
move price game of section 6.4, 


6.7.2.1 The One-Period-Deviation Criterion 


The reader may be puzzled by the statement in the text 
that no single-period deviation from equilibrium behavior 
being profitable is sufficient to ensure that strategies 
(R,,R3) are optimal; indeed, when computing V'(p) at 
date t, we assumed that firm 1 would react at dates ! + 2, 
t+ 4,... according to the prescribed reaction function, 
R,(-). Could it be possible that a single deviation from 
R,(-) at date t is not profitable but that a sequence of 
deviations from R,(-) at dates t,1 + 2,... is? The answer 
is No. (The following reasoning applies to much more 
general games and, in particular, to the supergames con- 
sidered in section 6.3.) First, a finite sequence of deviations 
during n periods cannot be profitable for the deviating 
firm if a single-period deviation is not, for the nth devia- 
tion is unprofitable because it amounts to a single-period 
deviation. Thus, if the firm benefits from the n deviations, 
it a fortiori benefits from the n —1 first deviations, If 
we eliminate the last deviation, the (n — 1)th deviation 
becomes a single-period deviation and therefore is not 
optimal, and so forth. Next, consider a profitable infinite 
sequence of deviations from R,(-). If it yields an extra 
payoff of & > 0 relative to following R,(*) all the time, 
then (because 6 < 1 makes the distant future almost in- 
significant in terms of payoffs) the first n deviations alone 
yield an extra payoff of «' > 0 for n sufficiently large. 
However, we know that finite deviations will not do. 


6.7.2.2 The Dynamic Programming Equations 


To simplify the notation, we look for the necessary and 
sufficient conditions that correspond to a symmetric equi- 
librium—that is, to conditions for which R, = R; = R. 


a similar] use of this condition. Formally, let Fip,|Q,) denate the cumulative 
distrbutfon of the price conditional on the aggregate output. with density 
Aim1Q) Jt satisfies the MLRP if 


re a 


47. re tee Naeger 
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48. If one restricted punishments to be of the Cournat type: the optimal length 
of punishment would be T= +20, from the Abreu-Pearce-Stachetti result on 
the harshest possible punishment V~ This is to be contrasted with Porter's 
(1983b) demonstration that if the previous assumptions do not hold, the opti- 
mal length of Cournot punishments can be finite This also differs from the 
above price game. where (Bertrand) punishments are finite. The price game and 
the quantity game are technically different, the quantity game exhibits a com: 
monly observed variable (the market pnice) whereas the pnce game does not 
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Consequerftly, we no longer index the variables by the 
name of the firm. We assume that there is a finite number 
of possibl prices p,. R will be interpreted as a Markov 
chain. Tha} is, there is a finite number of possible states, 
where the]state refers to the price to which one of the 
firms is cyrrently committed. The reaction function de- 
fines a trahsition from the current state to a new state, 
which is He price chosen by the other firm in reaction to 
the curren state. Let a4, > 0 be the transitional probabil- 
ity that tHe firm (1 or 2) reacts to price p, by charging 
price p,: 


Lem =1 
ry 


Finally, lef (pg, p,) be the instantaneous profit of the 
firm (1 or] 2) when its price is p, and the price of its 
ee is pp. 

We intfoduce the valuation functions from dynamic 
programming. V, is the discounted value of the profit of 
a firm that chooses its price when the other firm has 
chosen p, jin the preceding period. The discounted value 
of the profit of the second firm is denoted W,. Using our 
notation, we have 


V, = max|IT( py. pp) + OW). 


cs 
This yield the following set of equations: 
Y=Na, 
ri 


Ww = Lapel (rare) + 6V/), 


(Vi, — 11h Pa) — OWa lot = 0, 


HelTT( pry. Pa) + OWA), 


(6.17) 
Vy > M1 ph. py) + OW, 
Yom 2 y 
™ 
> 0. 
The first two of these equations simply use the definitions 
of shea V and W, The third is a relation of comple- 
mentary dlackness. In order for price p, to be an optimal 
reaction i price p, in the maximization exercise asso- 
ciated with the first equation (2,4 > 0), the discounted 


value corresponding to py, TI( py, py) + 5Wy, must achieve 
a maximum of V, with respect to py. 

The set of equations 6.17 has as unknowns {Vj,Wy.2y4}- 
We are interested only in x,,, which determines the reac- 
tion functions. (The system is known mathematically as a 
complementary bilinear program.) 

To check that the kinked-demand-curve strategies of 
subsection 6.4.1 form an equilibrium, it suffices to com- 
pute V, and W,, for all ht from the presumed strategies and 
check that equations 6.17 are satisfied. 


Exercise 6.10°" Compute the valuation functions for the 
strategies described in subsection 6.4.1, compute 2, and 
check that the strategies are in equilibrium for 6 close to 1. 


6.7.2.3 Profits Are Bounded away from Zero 


In the text, it was stated that profits cannot be close to the 
competitive profit in a Markov perfect equilibrium. Let us 
show that in a symmetric equilibrium the average indus- 
try profit per period must exceed M(p™)/2 (where T1(p) 
= (p — c)D(p)) for 6 close to 1. (A basically similar proof 
shows that, even in an asymmetric equilibrium, at least 
one firm must earn an average profit of no less than 
T(p™)/4) 

Let V(p) and W(p) denote the present discounted 
value of profits of the firm whose turn it is to choose a 
price and of its rival. The price grid is assumed discrete*® 
with size k, where k is “small” (e.g., prices are denomi- 
nated in cents). Let p™ denote the monopoly price, and 
consider price p™ + k. Let p* be the smallest price that 
solves 


ax( max (I(p) + 6 W(p)), 
p<prrk 


‘a 
Meret + 5W(p™ +k), max swim) 


pa ptk 


Then a firm's reaction to p™ + k is not lower than p* 
Case a: p* > p™ 


In this case, starting from any price, each firm's payoff 


49. The reasdn for taking a discrete-price grid is technical. Even in 2 static 
framework, a|firm’s best reaction to its rival's price is not well defined for 
perfect substilutes and a continuous price grid (because this firm generally 


would like to come as close as possible to its rival’s price while strictly under- 
cutting it), 
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when it plays is at least 
&(M(p"|— k) + 5 Wip™ — KL 


since it dould raise its price to p™ + k and then, after its 
rival's eee (which exceeds p™), undercut to p™ — k. 
For the same reason, we have 


W(p™ — k) > 6? (I1(p™ — k) + 6 W(p™ — k)]. 
Thus, 


ae | 5 PM(p™ —k 
Wi" |) > 
and, hente, 
SI p™ — k) + 8 Wip™ — _ pps Seb 


1-s* 


Thus, eath firm's intertemporal profit is at least 


Oh M1(p™ — k) 
14546459) 1-6 
when it tharges a price, and at least 6 times this when it 


has ca its price last period. For 6 close to 1, this profit 
is at leas 


1/T(p"|— &) 
4\ 146 /' 


which arnounts to a per-period profit 


T(p™ —|k) 
4 


close to 


T(p") 
4 


for a find price grid. 
Case b: pi’ < p™ 

In this cdse we have 
TI(p*) +6 W(p*) > 
On the gther hand, 


ae Te) 5 52 W(p"). 


TI(p™) + 6 W(p™). 


Wip™) 2 5 


The latter inequality holds because a firm that is com- 
mitted to p™ is at worst undercut to p* and can react to p* 
by charging p” itself. (If its rival charges p > p*, the firm 
makes at least 

dTI(p*) + 5*W(p*) 


by undercutting to p*.) Multiplying the first inequality by 
6 and adding the two inequalities yields 


= ) > 9 (rye pmy — He") 
a awn) > 2 -(m00™ "), 


From the definition of the monopoly price, we have 


mips) — Hie, He") 
2 Cs 

and 

at 

1+6 2 


for 6 close to 1. Thus, again, the average profit per period 
and per firm exceeds one-fourth of the monopoly profit. 
It can further be shown that in a kinked-demand- 
curve equilibrium, each firm’s per-period profit exceeds 
$11(p™) for 6 close to 1. Conversely, the set of focal 
prices (steady states for some kinked-demand-curve equi- 
librium) is exactly the set of prices p under p™ such that 
T(p) > $1(p™) and above p™ such that II(p) > 4 11(p™). 
Thus, even though there is no approximately competitive 
equilibrium, there is a large number of equilibria. 


6.7.2.4 The Renegotiation-Proof Equilibrium 

Let p denote the price under p™ such that 

T(p™) 
2 


a+amp>2 > (1 + d)M(p — k) 


(where k is, as before, the size of the price grid). Note that 
for 6 close to 1 

1(p) = 1(p™)/4, 

and consider the following symmetric reaction function: 


a) SBR ifp<p<p™ 
Rn) = yom otherwise. 


Exercise 6.11°* Show that (R*, R*) forms a Markov per- 
fect equilibrium for 6 close to 1. 
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(R*,R*) thrns out to be the unique pair of equilibrium 
reaction finctions that yield average industry profit close 
to TI(p™)| for 6 close to 1 (see Maskin and Tirole 1988). 
This property trivially implies that this is the unique sym- 
metric repegotiation-proof equilibrium for 5 close to 1. 
(It is ren¢gotiation-proof because any other equilibrium 
yields lec aggregate profit, and therefore a lower profit 
for at least one firm; furthermore, starting from any price, 
any otheq symmetric Markov perfect equilibrium yields 
less profit than this one to both firms, who therefore 
would have an incentive to “renegotiate” to move to this 
equilibriumn.) 


6.7.3. Fplk Theorems 


In this subsection we recall the existing versions of the 
Folk theofem. 


6.7.3.1 Infinitely Repeated Games of Complete 
Information 


Consider |an n-person “static” game defined by action 
spaces Aj for each player i = 1,...,m, and payoff func- 
tions IT'(4),..., &j,..«,A,) for each player i, where a, be- 
longs to [A). Suppose for simplicity that the set of pure 
strategies] is finite (for instance, in a price game, prices 
have to He denoted in cents, are non-negative, and are 
bounded fbove by a large number). We do not distin- 
guish betWeen pure and mixed strategies, so that we can 
think of T as the set of probability distributions (mixed 
strategies] over the pure strategies available to player i. (It 
is also tedhnically convenient to assume that the players 
can play correlated strategies, i.e., that they can make 
their actidns contingent on the outcome of a public ran- 
domizing |device; but we will ignore this here.) The static 
game is often called the “constituent game.” We will use 
the notatibn 


a Apna payee Ande 


and 1'(a;|a_;,) for player i's profit. 

We define player i's reservation utility as the worst 
outcome player i can be forced to take in this game: 
IT = mith max I(a;,2_,). 


Anticipating actions a_, by its rivals, player i maximizes 
TI‘(a,,a_;) in a static framework. Clearly, player i cannot 
get less than IT” in the constituent game (or less than T1* 
“on average” if the constituent game is repeated over 
time). 

A payoff vector 1 = ..., TI") is individ- 
ually rational if, for all i, TI‘ > M1", It is feasible if there 
exist feasible strategies a = (a,,.,.,4;....,@,) such that, 
for all i, 11’ = M(a). 

For instance, in the Bertrand price game or the Cournot 
quantity game, the individually rational profits are equal 
to zero. (Firms cannot be forced to make negative profits; 
on the other hand, one’s opponents’ charging a zero price 
or producing quantities that make the price fall below 
one’s marginal cost prevents the realization of profit.) It is 
easy to check that any set of profits whose sum does not 
exceed the monopoly profit is feasible. 

Consider the infinitely repeated version of the constit- 
uent game. Let 6 denote the discount factor. Player j then 
has payoff 


vig 2 ST (a, (),...,a,(t)) 


and average payoff 
v' =(1—4)V', 


where a;(t) denotes the action chosen by i at (which is a 
function of past history). 

Our first Folk theorem is due to Friedman (1971). It 
states that any average payoff vector that is better for 
all players than a Nash-equilibrium payoff vector of the 
constituent game can be sustained as the outcome of a 
perfect equilibrium of the infinitely repeated game if the 
players are sufficiently patient. More precisely, let 


TS = TH a...,a)5° 


and let v=(v',...,0") such that v is feasible and vo! 
> IT" for all i. Then there exists 5, < 1 such that, for all 
5 > do, v is an equilibrium payoff vector. 

The proof of this Folk theorem is basically the same as 
that in the text. For expositional simplicity, suppose that 
there exist pure strategies a = (a,,..., a,) such that v! 
= M'(a,,...,.,) for all i, Specify the following behaviors: 


50, That is, Ma, a%)) > I'(a,,a%) for all rand a, in A, 
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strate, 
the p| 


beha 
most 
gain 


(vi — 


oe Nash threats—i 


ies a before, If someone has deviated in the past, 
layer plays a’. Thus, collusion on a is enforced 


Each fee plays a, as long as all players have stuck to 


threats to revert to Nash 
ior forever. By deviating today, a player gains at 
p bounded amount; on the other hand, he loses the 
ftom future cooperation: 


ITN) (6 + 6? +--+), 


which tends to infinity as 6 tends to 1, 


For 
most 
peate 
perio 
the s 
have 
const! 
reset 
that a} 


the Bertrand game (which is potentially one of the 
linteresting applications of the theory of long re- 

interaction, because prices are locked in for short 
is of time), this theorem gives a full description of 
of equilibria for 6 close to 1. This is because if firms 
dentical marginal costs, the Nash equilibrium of the 
tuent game yields zero profits and thus yields the 
ation payoffs. The previous theorem then shows 
| individually rational and feasible payoffs are equi- 


libriun payoffs for 5 close to 1. For other constituent 
gamet (e.g., Cournot competition), the Nash points do 


not yi 
vation 


theort 
vidual 


leld reservation values (see figure 6.5). 


In ene in which Nash equilibria lie above the reser- 


values, the question arises whether other equilib- 


m can be enforced. The answer is that every indi- 
ly rational and feasible payoff vector can be enforced 


rium fre vectors beyond those given by the previous 
I 


in peffect equilibrium, Aumann and Shapley (1976) and 


intuiti 
can b 


os as} 
contin 


nas to why any payoffs above reservation values 
sustained is as follows: 


mtb (1979) show this in the case 6 = 1.5! The 


long as everyone has previously conformed, players 
e to play their a,'s, leading to payoff v,. If some player 


j devidtes, he is, as before, minimaxed but, rather than forever, 


only | 
tained} 
go ba 
with 

that, i 
he in 


the ave 


mg enough to wipe out any possible gain that he ob- 
from this deviation. After this punishment, the players 
k to Heir a;'s. To induce the punishers to go through 
heir minimaxing, they are threatened with the prospect 
any one of them deviates from his punishment strategy, 
tern will be minimaxed by the others long enough to 


1? ne 


Equilibrium 


i". nud 

payoffs 

enforceable 

through Nash. 

threats 

i 1 

0 1 TT 0 1 an 
Bertrand competition Cournot competition 


Figure 6.5 
Nash threats. 


make such a deviation not worthwhile. Moreover, his punishers 
will be punished if any one of them deviates, etc. Thus, there 
is a potential sequence of successively higher-order punishments 
where the punishment alt each level is carried out for fear the 
punishment at the next level will be invoked, (Fudenberg and 
Maskin 1986a, p, 538) 


Fudenberg and Maskin (1986) show that, under a 
mild regularity condition, lower-hemi continuity holds at 

= 1; All individually rational and feasible payoffs can 
be sustained in perfect equilibrium for 6 sufficiently close 
to 1. 


6.7.3.2. Finitely Repeated Games of Complete 
Information with Multiple Equilibria in the 
Constituent Game 


We saw in the text that even with a long horizon, collu- 
sion cannot be sustained in the finitely repeated Bertrand 
price game (or prisoner's dilemma). However, when the 
constituent game has several Nash equilibria, it is possible 
to play with the various Nash equilibria and obtain equi- 
libria other than the Nash equilibria in the repeated version 
of the game. This is illustrated for the constituent game 
(game of coordination) in figure 6.6. 

This game has two pure-strategy Nash equilibria: (LI, L) 
and (D, R). Suppose that this game is repeated three times 


ie absence of discounting, the sum of a player's payolfs over time may 


51, In 
not be Fives (may be infinite). One can then use either the limit (infimum) of 


age payoff 


269 


ey 
Ty, Mal) 


when T tends to infinity, or the so-called “overtaking criterion” (see Rubinstein 
1979). 
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Figure 6.6 


and that Ait is no discounting. Then (D,L), yielding 
zero payoff to both players in the first period, can be 
sustained {n that period by the promise of coordinating 
on (UL) jn the two subsequent periods if both players 
comply, ard by the threat of coordinating on (D, R) in the 
two subsdquent periods if anyone deviates in the first 
period. Befause 


S+1+4<0+5+5, 


(D, L) can thus be sustained in the first period. 

Benoit 4nd Krishna (1985) show that, under some con- 
ditions, tHe set of equilibria of a repeated game with 
multiple efuilibria in the constituent game converges to 
the set of Individually rational and feasible outcomes, 


6.7.3.3 Hinitely Repeated Games of Incomplete 
Informatiqn 


Here we donsider the generalization of the model in sub- 
section 6.4.1. Consider a finitely repeated game in which, 
with probpbility 1 — a, player i is “sane” and is described 
by the adtion space A; and the payoff TI'(a,,.-.,4,) in 
each perifd, and in which player i is “crazy” with prob- 
ability x. (This player's preferences or strategy are then 
at the disposal of the modeler. See below.) 

Fudenterg and Maskin (1986) show that, under a mild 
condition] the Folk theorem applies, in the sense that any 
individually rational and feasible payoff vector of the 
constituent game (for sane players) can be sustained as a 
perfect Bayesian equilibrium of the finitely repeated game 
for arbitrdry small probability , as long as the horizon is 
sufficiently large and the discount factor is sufficiently 
close to 1}. 

The prpof that any payoff that exceeds a Nash payoff 
is ctr is, as usual, simple (see also section 6.5.1). 

etch it. Let a = (a¥,...,a) denote a Nash 
equilibrium of the constituent game when players are 


Let us s! 
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sane with probability 1. Let I'S denote the correspond- 
ing payoffs, let 


of = M(a,,...,4,) > 1%, 


and let the crazy player i play a, as long as all players 
have done so in the past and a)’ if anyone has deviated 
in the past. The one-period payoff from deviating for 
a sane player is bounded above. At the same time, the 
loss of future cooperation with the crazy players is 
a"""(pi — TIT if Tis the horizon and if, for simplicity, 
there is no discounting (6 = 1). This loss goes to infinity 
as T tends to infinity. Hence, for T > To it cannot be opti- 
mal for player i to deviate from a, even if he is sane. 
More generally, collusion on a is sustained during at least 
T — Ty periods, which means that the average payoff for 
a sane player i tends to v‘ when T tends to infinity. 

As usual, this proof yields the full Folk theorem for 
the Bertrand game. For more general games, the proof of 
the Folk theorem is more involved; see Fudenberg and 
Maskin 1986a. 


Answers and Hints 


Exercise 6.1°? 


(i) Let ®,(p) = (p — ¢;)D(p). Note that ®, is (by as- 
sumption) concave, increases up to the monopoly price 
pi", and decreases thereafter. After substituting s,, one 
gets 


1 =|max © (9(1— =) 
Pe en) 


The fifst-order condition is 
Tr TP,(p) 
© = 
an) (PBR p) 


This itnplies that ©, and ®', have opposite signs. Because 
pt <|p3. we have ®; <0 < 4 and p? <p <p?. The 
One derivative of the objective function is 


7 ni? 
int a) + 
Pe (p)T12(s (p))? 
}(p) 


©; (p) 1 — 


®, (p) T1753 (p) 

D3(p) 

20; ( p) 11? ( p) 
3(p) 


The first three terms in this expression are negative. The 
fourtl| term is negative when the first-order condition is 
satisfied. Hence, the objective function is quasi-concave, 
and we do have an optimum. 

(ii) [This is straightforward. 

(iii)] Take the derivative of the first-order condition 
with respect to p and I1?. Writing 


$2 = -(p — ¢2)®4(p)/(cy — c,)D(p) 


and taking the derivative shows that s, is an increasing 
functipn of p and therefore of T1?. When the profit target 
for fitm 2 is zero, the highest profit for firm 1 is its 
monopoly profit and is obtained for p = p™(c,) and sz 
= 0. Conversely, in order for 11? = I1"(c,) to be obtained, 
the price must be p = p™(c2), and firm 2 must serve the 
whole market. More generally, there is a trade-off be- 
aggregate efficiency (which requires p = p™(c;) 
and sj = 0) and sharing of profits. The higher the profit 
target for firm 2, the higher the market price and the 
higher firm 2’s market share. See figure 6.7. 


p™ (cy) ANIC) 
pc) 
BA 

g ae Hcp) 
Price-market share (Constrained) Pareto 
relation frontier 

Figure 6.7 

Efficient market-sharing allocations. 

(iv) From the envelope theorem, 

ant _ _ ®,(p) 

atl? ©,(p) 

Using the chain rule, 

am (terse, + 1 (p)®3(p)\ dp $10 

ate? ©3(p) a> 


because 
1 (p) <0 < M4(p), 


and from question iii, 


dy 
ap > ° 

Schmalensee (1987) uses axiomatic bargaining theory 
to select a point on this convex Pareto frontier. He ob- 
serves that, if the leading firm's cost advantage is substan- 
tial, its likely gains from collusion are relatively small. (In 
the extreme case, if the low-cost firm's monopoly price is 
lower than its rival's marginal cost, the low-cost firm 
cannot gain from colluding.) 

(v) Question iv implies that our “efficient market-sharing 
allocation” is optimal only in the class of deterministic 
allocations. The firms could obtain higher expected pay- 
offs by “tossing a coin’ to decide who will be the monop- 
olist. More formally, the firms could achieve any point on 


52, 


i} exercise is derived from Bishop 1960 and Schmalensee 1987- 
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one of them be a monopolist depending on the realization 
of a random variable. Alternatively, in a repeated-game 
context |with little impatience (so that the firms’ objec- 
tive is Japproximately the average of their profits— 
see section 6.3), the firms could take turns being the 
monopdlist. 

(vi) Suppose that firm 1 sells q, at price p, <p». 
Firm 2'9 residual demand is D(p.)— q, under efficient 
rationing. This demand would be left unaffected if firm 1 
raised ith price. Hence, firm 1 could continue selling q, at 
a price Higher than p, without hurting firm 2. 


the oe line between A and B in figure 6.7 by letting 


Exercis¢: 6.2 


(11', 11?} can be approximated as closely as desired for 6 
close to} 1. Choose a price p in |c,p™] such that T(p) 
= Tl! + TP, and let M! = x11 (p) and 1? = (1 — Mp). 
Considef the ratio /(1 — x). We know that any real 
number kan be approximated as closely as is wished by a 
rational pumber. Let m/n denote a rational approximation 
of «/(1 }- x). Consider the following strategies: “During 
the first |m periods, firm 1 charges p and firm 2 charges a 
price strictly exceeding p; for the n subsequent periods, 
firm 2 aaa p and firm 1 charges a price strictly ex- 
ceeding|p; during the m subsequent periods, it is firm 
1s turn|to have the market at price p; and so forth. If 
anyone Heviates, the firms charge the marginal cost for- 
ever.” These strategies clearly form an equilibrium for 6 
close to|1. Furthermore. the per-period payoff for firm 1 is 


We es ourselves with showing that any payoff 


(1 —O)EH pL +b poe FE) + (GM me 2M) poe] 


if} d+---+6"" m 
Tgp gre p= = cl (p) 


for 6 clese to 1, 
Exercis¢ 6.3°* 


The highest possible per-period profit, I, is the same for 
both firms (because the game is symmetric, the set of 
attainable per-period profits is also symmetric). Consider 
an equilibrium in which firm 1, say, gets a per-period 
profit I] — « (where « is positive and small), and consider 


33. The following proof is from Bernheim and Whinston 1986. 
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a price p such that II(p) > Tl —«, price p is the lawest 
price charged for some period f, and firm 1 makes profits 
5, 11(p) > M1 —e in that period. (Such a price and period 
must exist; otherwise firm I could not make TI — ¢ “on 
average.”) At t firm 2 ought to deviate and charge slight- 
ly less than p. By doing so, it would gain s, MI(p)/2 at date 
t (because it would capture the whole market). But the 
loss of future collusion costs at most 


6+e4+--)=12 <t—« 
1-6 

It suffices to choose ¢ such that 

Me 6 

tm 1-6 


to obtain the contradiction. 
Exercise 6.4 


The monopoly price is sustainable if 


(=) 1 iy + oy 
n n 

(The LHS is the gain from deviating; the RHS is the 
long-term loss.) That is, 


ou >1i—-. 
n 


For a given 6, this condition is more easily satisfied if the 
market is expanding. (The intuition is that the future 
weighs more heavily in such circumstances.) 


Exercise 6.5 


Let { p*,s}} denote an efficient market-sharing allocation, 
and let 


M1" = s{D(p")(p* — ce) 
and 
f1?* = (1 — s})D(p*)(p* — e2) 


denote the corresponding per-period profits. Consider the 
following strategies: “Each firm i charges p* and produces 


Chapter 6 


s{D(p*) as long as both have complied with this rule 
earlier. If anyone has deviated in the past, both firms 
revert to Bertrand behavior forever," 

Lopk at the most profitable deviation from equilibrium. 
For firm 1 it consists of undercutting to its monopoly 
price} Thus, firm 1 makes a short-term gain equal to 
T1'" - T1"*, where M1!" = max[D(p)(p — c,)]. The long- 
term Joss is 


4(I1"} — (2 — ¢)D(e2))/(1 — 8), 


wher (c) — c,)D(cz) is firm 1's profit in the Bertrand 
equilibrium, Thus, the market-sharing allocation must 
satis! 


T'™ + Mt < 5(I1!* — (cy — c,)Die))/(1 — 8). (1) 


Note|that, for a given 6, this is satisfied if and only if 
TI'* éxceeds some given level, or st > s,(5) > 0, where 
5(6)|is defined by (1) (recall that [1'* is a linear function 
of sf) 

Firm 2's optimal deviation from p* is to under- 
cut slightly and capture the whole market (because p* 
<p(c2)). It then realizes a short-term gain of almost 
s{D(j")(p* — cz). The long-term loss is I17*/(1 — 6), be- 
cause] the firm makes no profit in the Bertrand equilib- 
rium. |Thus, we must have 


s}D(n")(p* — cz) < 6(1 — s})D(p")(p* — c)/(1 — 8); 
that ig, 
s{/(1 = sj) < 8/1 — 6), 
or 


ST <0. 


tainable in equilibrium if s,(6) < st < 6.4* This naturally 
meang that an efficient market-sharing arrangement can be 
sustained only if it is “not too unfair" to one of the firms. 

As |was noted in section 6.2, the firms could do even 
better| by taking turns being monopolists, because the 
Paretq frontier in the payoff space is convex. For instance, 
firm 1} could supply the whole demand at p® in odd 


ne the efficient market-sharing allocation is sus- 


periods and firm 2 the whole demand at price p? in even 
periods. (In case of deviation, the firms would revert to 
competitive behavior.) The per-period profits would then 
be approximately IT/"/2 and IT3/2, respectively, for 6 
close to 1. 


Exercise 6.6 


(i) The implicit discount factor in market 2 is 6?—a 
firm can deviate during two consecutive periods without 
being detected, 

(ii) The optimal deviation is to start with market 2 and 
then deviate on both markets in the following period 
(a deviation in market 1 triggers punishment in the 
following period). Thus, the maximal gain from deviating 


is 
nm 
zt + 26), 


The loss is 
enn — 6), 


because the deviation is detected with a two-period lag, 
and collusive profits are [1/2 per market. 


Exercise 6.7 


Making price quotations public reduces the scope for 
bribery and favoritism. However, it may help the partici- 
pating firms to tacitly collude by giving the industry 
information about price cuts. At least, so runs the conven- 
tional wisdom: “It would ... be hard to find a device 
less calculated to foster open and aggressive competi- 
tion among [oligopolistic] sellers.” (Cook 1963) (See also 
Scherer 1980, p. 224.) 

Things are more complicated in fact. The theory devel- 
oped in this section and in subsection 6.7.1 assumes a 
random and unobservable demand, whereas in the case of 
public bids the demand is generally known to all parties 
(so firms may learn that undercutting took place by 
observing who wins, even if bids are kept secret). More 
work seems warranted to assess this facet of the conven- 
tional wisdom. 


54, Such) sf may not exist. But, as can be checked, it does exist as long as 
TI > (d, — 6, )D(e3) 


and 6 is fufficiently close to 1 
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Exercise 6.8 

See section 6.7, For a = 4, T is the lowest time such 
that 36 4" > 2. Hence, 6 > 3 if this condition is to 
be satisfied for some T. For T = 1, it is not satisfied unless 
b=1, 

Exercise 6.9 


See Greer) and Porter 1984. 
Exercise 6.10 


(All payoffs are multiplied by 36.) 


Vg = Vs + Vy =9 + 614.5/(1 — 8)) = 45(7=3), 


Vy = 4.541 — 6) = Wy, 
V,=5 +/6W,, 


é 


45 = W, = Ws = Wy = Wo. 


WwW, = x( 43) +(1-— a(2s + 


The probability « is such that V, = 2.5 + dW, (each firm 
is indiffer¢nt between staying at p, and relenting to the 
monopoly price). Thus, 


a = (46 +] 96? — 5)/(55 + 96?) 


(that is, « js approximately equal to $ for 5 close to 1). 

Checking that the strategies form an equilibrium is a bit 
tedious. We saw in the text that undercutting from p, to 
P2 is not profitable. Let us simply show that at p, a firm 
prefers to| undercut to p, rather than to revert to the 
monopoly price. By undercutting, it gets 


5+6W, & 5 + (V, — 2.5) 


é é 
25+ V, =2: rere. —— 
a eh wtp eas 


which is what it would get by relenting. 
Exercise §.11 


See Maskih and Tirole 1988. 
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Chapter 6 


Praduct Differentiation: 
Pri¢e Competition and 
Non-Price Competition 


One of the crucial assumptions behind the Bertrand para- 
dox (chapter 5) is that firms produce a homogeneous 
product, Hence, price is the only variable of interest to 
consumers, and no firm can raise its price above marginal 
cost without losing its entire market share. In practice, 
such an assumption is unlikely to be satisfied. Some 
consumers will prefer buying the firm's brand even 
at a small premium because it is available at a closer 
store, can be delivered sooner, or comes with superior 
post-sale services; some other consumers will remain 
faithful to the high-price firm because they are unaware 
of the existence of other brands; still other consumers 
will be concerned that alternative brands do not have 
the same quality or will not satisfy their preferences 
as well. In short, the products are differentiated; the 
cross-elasticity of demand is not infinite at equal prices 
As was noted in chapter 5, it is clear that a price above 
marginal cost can (and therefore will) be sustained under 
product differentiation. Diversity prevents unbridled 
competition for customers, even in a nonrepeated rela- 
tionship. This chapter addresses two main issues: the 
determination of prices under differentiated products 
(assuming a nonrepeated interaction) and the choice of 
products in oligopoly. This chapter thus extends the anal- 
ysis of chapter 2 by including the effects of strategic 
interaction. 

Prices can often be adjusted faster than product charac- 
teristics can. To formalize this we will assume that firms 
competing in prices take these characteristics as given. 
Consequently, firms will choose their products anticipat- 
ing that their location in product space will affect the 
intensity of price competition. In making these assump- 
tions, we conform to the literature, which envisions a 
world where products are chosen first and prices second 
(similar to the way capacities were chosen before prices in 
chapter 5). 

The main points will be illustrated by means of two 
classic models. The first is the so-called location or spatial- 


differentittion model,’ in which different consumers are 
located 4t different places (see Hotelling 1929). Another 
interpretation of this model is that consumers have het- 


ity.) 


looks at}the firms’ entry and location decisions (non-price 
competition). At this stage, it suffices to work with the 
reduced}form profit functions and consider only the entry 
and location decision of each firm. 

We then develop the principle of differentiation, accord- 
ing to which firms generally do not want to locate at the 
same pl4ce in the product space. The reason is simply the 
Bertrand paradox: Two firms producing, perfect substitutes 
face unlridled price competition (at least in a static frame- 


m other firms. However, market conditions 
ain restrictions on this differentiation. For exam- 


demand for most of these products would never be 
sufficient to make them profitable. Therefore, fixed costs 
limit the spectrum. 

Next, we make the assumption that there are many 
potential firms (more than the number of firms that will 
eventually be operative, and perhaps an infinite number). 
Since in later chapters we will consider technological dif- 
ferences, we also assume here that all potential firms have 
the same technology. These two properties combine to 
yield an assumption of “free entry." What is the effect of 
this assumption on equilibrium profit? We already know 
that an entering firm must make a non-negative profit. 
Under the assumption of free entry, the profit of existing 
firms (assuming that the profit is the same for all firms) 
cannot be too large; otherwise, entry would occur until 
profit had decreased sufficiently to make further entry 
unprofitable. Therefore, the assumption of free entry nat- 
urally leads to that of approximately zero profit (actually, 
this intuition is valid only if the market is sufficiently 
large). To simplify our calculations, we will often assume 
that profits are zero, This assumption may lead to the 
number of firms being calculated as a noninteger. In such 
a case the actual solution, which must be an integer, is 
the integer closest to but not exceeding the real number 
calculated. 

In section 7.1 we develop two standard models of 
spatial differentiation: “on the line” and “on the circle.” 
We use these examples to demonstrate the nature of 
Bertrand competition with differentiated products, to 
enunciate the principle of differentiation, and to study free- 
entry equilibria. We also discuss product diversity and the 
number of goods in both a market economy and a social 
optimum. We conclude that section by discussing some 
factors that reduce the incentive to differentiate from 
other firms. In the supplementary section we consider 
another important model of product differentiation—one 
that uses the vertical-differentiation model introduced in 
chapter 2—to formalize quality competition. Because the 
analysis here closely parallels that of horizontal differ- 
entiation, the points of divergence between the quality 
model and the location model will be indicated? 

In section 7.2 we consider the concept of “monopo- 
listic competition,” introduced by Chamberlin (1933), 


1, See chapter 2 for a description of horizontal differentiation. 


2738 


2. For interesting applications of these models to the beer industry and the 
automobile industry, see Baker and Bresnahan 1985 and Bresnahan 1987. 
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Monopplistic competition refers to an “industry” with a 
large nimber of firms, each facing a downward-sloping 
demand|(for differentiated products) but making no profit 
because|of fixed costs, and such that strategic interaction 
is abser{t (i.e, each firm can ignore its impact on other 
firms). As we will see, this latter property distinguishes 
the mohopolistically competitive situations from the 
sso equilibrium studied in the spatial-differentiation 
model. [The supplementary section applies the familiar 
mode! gf Dixit and Stiglitz (1977) and Spence (1976) to 
an te of product diversity in a monopolistically 
competitive economy. 

In ie 7.3, ideas developed in sections 7.1 and 7.2 
are used! to study informational differentiation, another 
type off product differentiation that results from con- 
sumers'luneven information about the characteristics (ex- 
istence, |price, quality, etc.) of various products. We focus 
on the 0g between advertising and differentiation. After 
reviewing the conventional wisdom on advertising, we 
will seq how informative advertising can increase the 
elasticity of demand for a product and foster competition. 
We willlalso see that competition may yield (from a social 
standpdint) too much or too little informative advertising. 
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wn 


atial Competi 
7.1.1 |The Linear City 


We first consider a model (originally due to Hotelling 
[1929] |in which a “linear city” of length 1 lies on the 
abscissq of a line and consumers are uniformly distrib- 
uted with density 1 along this interval. There are two 
firms of stores, which sell the same physical good. For 
simplichy, and as a first step, these two stores are located 
at the ektremes of the city; store 1 is at x = 0 and store 2 
at x= [ The unit cost of the good for each store is c. 
Consumers incur a transportation cost ¢ per unit of length 
(this on may include the value of time spent in travel). 
Thus, a] consumer living at x incurs a cost of tx to go to 
store 1] and a cost of t(1 — x) to go to store 2. The 
consumers have unit demands; i.e., each consumes one or 
zero urfit of the good. Each consumer derives a surplus 


from consumption (gross of price and transportation costs) 
equal to 5. 

We will also consider a variant of this model in which 
the transportation costs are quadratic instead of linear. In 
this case, a consumer at x incurs a cost of fx* to go to 
store 1 and a cost of f(1 — x)? to go to store 2, In this 
version, the marginal transportation cost increases with 
the distance to the store. As we will see, the quadratic 
model is sometimes more tractable than the linear one. 


7.1.1.1 Price Competition 


In this subsection we take the firms’ locations as given 
and look for the Nash equilibrium in prices. Assuming 
that firms choose their prices p, and p, simultaneously,’ 
we derive the demand function for quadratic transporta- 
tion costs. Let us assume that the prices of the two firms 
do not differ so much that one firm faces no demand, 
and that the prices are not too high relative to ¥ (so that 
all consumers buy—ie., the market is covered), The first 
condition must clearly be satisfied in equilibrium, because 
a firm with no demand makes no profit and therefore has 
an incentive to lower its price to gain market share. The 
second condition is satisfied in equilibrium if the con- 
sumers’ surplus from the good § is sufficiently large. 

A consumer who is indifferent between the two firms 
is located at x = D,(p,,p;), where x is given by equating 
generalized costs; ive., 


pr + fe = py + 1 — 2)? 


The firms’ respective demands are 


ha—py +t 
Dy(pupa) = x= = 
and 
A—mtt 


Dip, P2) = 1 — 2 =~ 
When the firms are located at the two extremes of the 
city, the demand functions are the same for linear cost as 
for quadratic cost. (This is not robust. It does not 
hold if the market is not covered, and, as we will see 
shortly, it is contingent on the locations’ being the two 
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3, For the derivation of the demand functions for linear transportation. costs, 
see section 2.1 


Product Differentiation 


extremitles of the city.) In both cases, firm i’s profit is 
T(p,, py] = (pi — Ol py — py + 1/24. 


The goods produced by the two firms are strategic 
complenjents in prices (Ij; > 0). This important property 
will hold for all the models in this chapter except that of 
monopolistic competition, in which interaction is absent. 
Its role will be clarified in the next chapter. 

For either linear or quadratic transportation costs, firm 
i choose} p, so as to maximize its profit given the price p, 
charged py its rival; ie., 


n= ik (1'(p;, pj))- 
Pl 


The first}order condition for firm i is 
py tect t— 2p, =0, 


and the|second-order condition is satisfied. Using the 
symmetry of the problem, we obtain the competitive 
prices andi profits under product differentiation: 


p=pipett (7-1) 
and 
Tl! = Mf = #72. (7.2) 


We speak of differentiated products even though they are 
physically identical. The products are differentiated more 
for the dre when the transportation cost is higher. 
When f }ncreases, both stores compete less strenuously 
for “the kame consumers”; indeed, the neighboring clien- 
tele of a store becomes more captive, giving the store 
“monop@ly power” (which, in turn, allows it to increase 
its price} On the other hand, when t = 0 all the con- 
sumers chn go to either store for the same cost (0). The 
absence pf product differentiation leads to the Bertrand 
result, 

Becauje we are also interested in the firms’ choice of 
product fifferentiation, we would like to know how the 
equilibrim prices vary with the firms’ locations. We have 


looked at one polar case—the one in which firms are 
located as far as possible from each other (maximal dif- 
ferentiation). The other polar case is that in which they 
produce the same product—i.e,, they are located at the 
same point (say x9) and their goods are perfect substi- 
tutes. Comparing the generalized costs p; + tx — x| (or, 
in the quadratic case, p, + (x — x9)?) for a consumer 
located at any point x amounts to just comparing prices 
p; and p,. Hence, the Bertrand result holds for identical 
locations: 


pi =ps=0 (7-3) 
and 
T=? =0, (7.4) 


More generally, let us assume that firm 1 is located at 
point a > 0 and firm 2 at point 1 — b, where b > 0 and, 
without loss of generality, 1 — a — b > 0 (firm 1 is to the 
“left” of firm 2; a = b = 0 corresponds to maximal differ- 
entiation and a + b = 1 corresponds to minimal differen- 
tiation, i.e., perfect substitutes). The linear-cost model is 
not very tractable if firms are located inside the interval, 
because when a firm lowers its price to the point that it 
just attracts the consumers located between the two firms 
it also attracts all consumers located on the other side of 
the rival.* The firms’ demand functions are discontinuous. 
Their profit functions are discontinuous and nonconcave. 
Consequently, the price-competition problem is not well 
behaved. Indeed, d'Aspremont, Gabszewicz, and Thisse 
(1979) show that if the firms are located close to the 
center of the segment (but not at the same location), no 
pure-strategy price equilibrium exists.* 

The quadratic-cost model allows us to sidestep these 
technical issues. The demand and profit functions are well 
behaved (continuous and concave). We obtain 


1—a—b Po— Pi 


2 21 —a — b) acs? 


Dy(pyepa) =x =a + 


4. Suppose] that a consumer is located at x >1—b>a, This consumer 
belongs to frm 2's “turf” or “back yard." His choice between the two firms is 
determined by the comparison between 

pm tie—h 
and 


ps + te — [1 — by, 
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Le, between p, and ps — KI — a — b). Thus, all consumers located to the right 
of firm 2 always make the same brand choice as the consumer located at firm 
2's location. This means that at p, =p: — (1 — a — b) the demand functions 
are discontinuous; all consumers on firm 2's turf switch to firm 1 for a small 
reduction in py 


5. A mixed-strategy price equilibrium does exist. See Dasgupta and Maskin 
1986, 
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and 


= gin SASS Pi — Pe 
Bein) = ee a aa 


(7.6) 


(as long as these are non-negative and do not exceed I 
and as|long as is sufficiently large that the market is 
covere| 

To interpret equation 7.5, notice that for equal prices 
firm 1 ¢ontrols its own turf (of size a) and receives the half 
of the tens located between the two firms who are 
closer i firm 1 (numerically, (1 — b — a)/2). The third 


term of equation 7.5 expresses the sensitivity of demand 
to the price differential. 
The|Nash equilibrium in prices, which always exists, is 


psa, b\=e+ 1 —a =w(1 ae =). (7.7) 


Pil bj= e+ aa —a—W)(1 + *=*), (7.8) 


Exercist 7.1" Check equations 7.5 through 7.8. 


7.1.1.2} Product Choice 


Suppose now that there are two firms and that each firm 
is allowed to choose only one product (that is, only one 
location). This defines a two-stage game in which (1) the 
firms choose their locations simultaneously and (2) given 
the lokations, they choose prices simultaneously. As 
was mpntioned earlier, each firm must anticipate how its 
choice} of location affects not only its demand function 
but also the intensity of price competition. Therefore, to 
study location (product) competition, we use the reduced- 
form profit functions, e.g., 


TT (a, b) = [ p¥(a, b) — cD, la, b, p§(a, b), pS(a, b)), (7.9) 


where|D, is given by equation 7.5. An equilibrium in 
location is such that firm 1 maximizes I*(a,b) with re- 
stb a, taking b as given, and similarly for firm 2. (This 
procedure is similar to the two-stage, capacity-and-then- 
price competition studied in chapter 5.) 

D‘Aspremont et al. (1979) show that for quadratic 
transportation costs, the equilibrium has the two firms 
ee at the two extremes of the city (maximal differ- 
entiatian). Each firm locates far from its rival in order not 


to trigger a low price from the rival, and thus price com- 
petition is softened. To show this, we could compute 
the reduced-form profit functions IT'(a, b) explicitly using 
equations 7.5 through 7.8 and solve for a Nash equi- 
librium; however, it is more elegant and instructive to 
proceed otherwise. Suppose, without loss of generality, 
that in equilibrium 


O0<a<1—b<1. 


We know that to maximize I'(a, b) (given by equation 
7.9) with respect to a we need not take the derivative 
em Opt 

6p, Ga” 

This is due to the envelope theorem: Firm 1 maximizes 
with respect to price in the second period, so @M1'/ép, 
= 0. Thus, we need only look at the direct effect of a on 
TT’ (the demand effect) and the indirect effect through the 
change in firm 2's price (the strategic effect). That is, 


am (aD, , aD, dps 
ae a(F Stren da} 


Using equations 7.5, 7.7, and 7.8 we get 


Aa 3 >sa—F 
+o —a—b? 6 —a—b’ 10) 


1 
Oa 2 


and using equations 7.5 and 7.8 we get 


DG _ 1 \i(-$+2 
Cp, da \2K1 — a — b) 3° 3 


—2+a 


= Se (7.11) 


Adding equations 7.10 and 7.11 and using the fact that 
the mark-up (pf — o) is positive, we can easily show that 
aTl'/da <0, Hence, firm 1 always wants to move left- 
ward if it is to the left of firm 2, and similarly for firm 2. 
Therefore, the equilibrium in locations exhibits maximal 
differentiation. 

Use of the envelope theorem (which will be reiterated 
in the next chapter) is also instructive. It exhibits the 
conflict between two effects. First, equation 7.10 shows 
that if a is not too big (in particular, if it does not exceed 
4, using 1 — b > a), firm 1 will want to move toward the 
center to increase its market share given the price struc- 
ture. This is part of a more general result that, for given 
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prices, the two firms want to locate at or near the center 
(see subsection 7.1.3). However, firm 1 also acknowledges 
that the afsociated decrease in product differentiation 
forces firm|2 to lower its price. The computations show 
that this $trategic effect dominates the market-share 
effect. 

It is interesting to compare the market-determined 
locations tp the socially optimal ones. Suppose that the 
er chooses locations for the two firms. Be- 
ption is fixed, the social planner minimizes 
the consumers’ average transportation cost (this holds 
whether the firms exercise their market power as above or 
are forced to price at marginal cost; for given locations, 
and as long as the market is covered, the pricing structure 
does not affect the sum of consumer surplus and profits in 
this inelastic-demand model). By symmetry of the prob- 
lem, the sdcial planner chooses to locate the two firms 
equidistantly on either side of the middle of the segment, 
so that for lequal prices a firm serves the left or the right 
half of the market. Hence, the location that minimizes the 
average trdnsportation cost on a market segment is the 
middle of this market segment when the density of con- 
sumers is iniform, Thus, the socially optimal locations 
are } and 3. In this example, the market outcome yields 
socially tod much product differentiation. 


Exercise 7.2f* Consider the model of differentiation on 
the line. The two firms’ locations are fixed, and they are 
the two extremities of the segment. Transportation costs 
are linear ip distance, and the distribution of consumers 
is uniform jalong the segment. The firms have constant 
marginal cqsts, c, and c,, which are not necessarily equal 
(but, for sifmplicity, assume that they do not differ too 
much, so that each firm has a positive market share in 
equilibrium). 

(i) Compute the reaction functions p, = R,(p;). Infer 
the Nash-equilibrium prices p;(c,, c;) and the reduced-form 
profits ['(c), c;) as functions of the two marginal costs. 

(ii) Show that 671'/Ac,dc, < 0. 

(iii) Suppose that, before competing in price, the firms 
play a first-period game in which they simultaneously 
choose their marginal cost. (Think of an investment cost 
(0) of chogsing marginal cost c, with ¢’ < 0 and ¢” > 0.) 
Show that,|as in the previous choice-of-location game, 
this investrnent game gives rise to a direct effect and a 
strategic effect. 
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7.1.2 The Circular City 


7.1.2.1 The Model 


The above consideration of a linear city allowed us to 
examine price competition with differentiated products, 
as well as the choice of product in duopoly. Now let us 
study entry and location when there are no “barriers to 
entry” other than fixed costs or entry costs. Assuming 
that there exist a large number of identical potential firms, 
we will look at the number of firms entering the market. 
To do so, it is actually more convenient to consider a 
circular city with a uniform distribution of consumers. In 
this case, the product space is completely homogeneous 
(no location is a priori better than another), which makes 
the study of the issue at hand more tractable. 

The following model is due to Salop (1979), Con- 
sumers are located uniformly on a circle with a perimeter 
equal to 1. Density is unitary around this circle. Firms are 
also located around the circle, and all travel occurs along 
the circle (like the linear city, this is a bit contrived in 
order to simplify the analysis, but one may think of a city 
around a lake, with boats being an inefficient transporta- 
tion technology; or of supermarkets in a circular suburbia 
with a costly-to-cross city at its center; or else of aircraft 
departure times), 

As before, consumers wish to buy one unit of the 
good, have a unit transport cost t (for simplicity, we will 
consider only linear transportation costs), and are willing 
to buy at the smallest generalized cost so long as the 
latter does not exceed the gross surplus they obtain from 
the good (5). Each firm is allowed to locate in only one 
location (we will discuss this assumption below, and es- 
pecially in the next chapter, where we will examine the 
possibility of entry deterrence through brand prolifera- 
tion). In order to address the issue of the number of firms, 
we introduce a fixed cost of entry, f, Once a firm is in 
and is located at a point on the product space, it faces 
a marginal cost c (smaller than 3). Thus, firm i’s profit 
is (p; — c)D, —f if it enters (where D, is the demand it 
faces), and 0 otherwise. 

Salop considers the following two-stage game: In 
the first stage, potential entrants simultaneously choose 
whether or not to enter. Let n denote the number of 
entering firms. Those firms do not choose their location, 
but rather are automatically located equidistant from one 
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another om the circle (figure 7.1). Thus, maximal differ- 
entiation is exogenously imposed. In the second stage, 
firms compete in prices given these locations. 

For realism, one would want firms to choose their 
locations ¢ither at the same time or after their entry de- 
cision, ratHer than having an auctioneer pick the particular 
location pattern. However, the point of Salop’s model is 
not to look at the particular product choice but rather to 
study the pxtent of entry (in this respect, this approach is 
similar to that outlined in sections 7.2 and 7.5.2). Omit- 
ting the choice of location allows us to study the entry 
issue in a simple and tractable way. We will come back to 
the issue A location choice later. 

We have assumed free entry (a large number of iden- 
tical firms), Consequently, the equilibrium profit of enter- 
ing firms lis zero (up to the integer problem) As has 
already been indicated, we must (1) determine the Nash 
equilibrium in prices for any number of firms and calculate 
the reducdd-form profit functions and (2) determine the 
Nash equilibrium in the entry game. 

Assume that n firms have entered the market. Because 
they are ldcated symmetrically, it makes sense to look for 
an equilibtium in which they all charge the same price p 
(see figure|7.1). For the moment, we will consider only the 
case in which there are enough firms in the market (which 
will correspond to f not too high) so that the firms are 
indeed competing among themselves. In practice, firm i 
has only two real competitors, namely the two surround- 
ing it.° Suppose that it chooses price p; (see figure 7.2). A 
consumer Joated at the distance x € (0, 1/n) from firm i is 
indifferent} between purchasing from firm i and purchasing 
from i's clpsest neighbor if 
ptt si + t(1/n — x). 
Then firm|i faces a demand of 


_pttin— 
= ; ; 


D( pu p) = 2x 


Therefore] firm i seeks to maximize 


max{ (7 = o(ete—e) -1} 
m 


Wa ‘a Wn 


4 Pins =P 


Figure 7.2 


Differentiating with respect to p, and then setting p, = p 
yields 


seit 
=ct+-. 
p n 


This result is analogous to the one found for the linear 
city. The profit margin (p — c) decreases with n, How- 
ever, the number of firms is endogenous; it is determined 
from the zero-profit condition for the existing firms: 

4 t 
A ant er a dh 
Therefore, the number of firms and the market price in a 
situation of imperfect competition with free entry are, 
respectively, 


n= Jif 


and 


pr=ct./i. 


6. This featute is due to the particularities of this location model. For instance, 
Archibald ard Rosenbluth (1975) show that in a Lancasterian world (see 
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chapter 2) with four characteristics, an average brand may have n/2 direct 
competitors, where 1 is the number of products. 
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A trivial but important point of models of this type is 
that firms|price above marginal cost and yet do not make 
profits. Thus, an empirical finding that firms do not make 
supranommal profits in an industry should not lead one to 
conclude |that firms do not have market power, where 
“market power’ is defined as pricing above marginal cost. 
(An econgmist’s definition of market power differs from a 
policy idk’ definition, Policy-makers generally mean 
pricing above average cost. For this second meaning, the 
firms in opr free-entry model have no market power.) 

The sing equations show that an increase in fixed 
costs caukes a decrease in the number of firms and an 
increase in the profit margin (p* — c). An increase in the 
transportation cost increases the profit margin and there- 
fore increpses the number of firms—firms see that there 
is an incrpased possibility of differentiation. Finally, we 
note that the consumer's average transportation cost is 


and that fe does not increase as rapidly as t. 

When the entry cost or fixed production cost f con- 
verges to zero, the number of entering firms tends to 
infinity ald the market price tends toward marginal cost. 
(Similarly} an increase in consumer density, with the fixed 
cost kept fonstant, would increase the number of entering 
firms and] push prices toward marginal cost.) Thus, with 
very low entry costs, each consumer purchases a product 
very close to his preferred product, and the market is 
approximhtely competitive. (As we will see in the supple- 
mentary jection, this property may not hold in vertical- 
differentiation models.) 

We continue with our previous locational-choice 
model by considering the equilibrium from a normative 
viewpoint. To this end, we compare the free-entry equi- 
librium with the allocation selected by a social planner. 
We seat know that the price charged by the firms 
is greatet than the marginal cost. However, in this 
case, where consumers all receive the same utility from 
the good] and each consumes only one unit, this price 
introduces no distortion. The quantity consumed—one 
unit—is hot affected by the mark-ups. Thus, the profit 
margin is|only a monetary transfer from the consumers to 


the firms. The question is whether there are too many 
or too few firms in relation to the social optimum. In 
deriving the social optimum, we will not be concerned 
with gross consumer surplus (5), since it is the same 
as in the case of imperfect competition. An omniscient 
planner would choose n = n* in order to minimize the 
sum of the fixed costs and the consumers’ transportation 
costs: 


min| nf + 1(2n ie rar) | 


or, equivalently, 


min(nf + t/4n). 


Therefore, we have 

n* =4,/t/f = 4n°. 

From this, we conclude that the market generates too 
many firms. (Because there is no distortion of consump- 
tion through pricing, whether or not the social planner 
is able to force firms to price at marginal cost in the 
regulated-entry context does not affect this conclusion.) 
The following exercise shows that similar results hold for 
quadratic transportation costs. 


Exercise 7.3" Show that if transportation costs are td?, 
where d is the consumer's distance to his selected shop, 
Salop’s model yields 


=c+ t/n?, 
and that, with free entry, 
n® = (i/f)'9 > n= (tof). 


Thus, under linear or quadratic costs, we obtain too 
many products.’ Firms have too much of an incentive to 
enter, Clearly, the private and the social incentive to enter 
have no reason to coincide. Entry is socially justified here 
by the savings in transportation costs (or, more generally, 
by the greater product diversity offered to the con- 
sumers). In contrast, the private incentive to enter is 
linked with “stealing the business” of other firms while 
still being able to impose a mark-up, We will come back 
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7. More generally, the market offers too many products for transportation 
costs x" for any > 0 (R. Costrell, private communication). 
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later to this effect, which is sometimes called the frade- 
Read effect. 


Remark| In the preceding model, we assumed implicitly 
that tha market equilibrium was such that 


rth <3 (ie, i Jif <F-«, of<7= 26-0). 


In othet words, the consumer furthest from a store (hav- 
ing to travel a distance of 1/2n*) receives a strictly posi- 
tive ne{ surplus. This assumption does not create any 
prob! for small fixed costs. When fixed costs increase, 
howevef, the number of firms decreases and both the 
distancd between the firms and the prices increase. When 
fexceeds /, the equilibrium can no longer be described as 
above, fince consumers located halfway between two 
firms will no longer purchase anything. See Salop 1979 
fora sid of this case (see also the following exercise), 


Exercise ¥.4"" Show that if f > fF but f is small enough so 
that the| market remains “covered,” there exists an equi- 
librium Jn which the firms charge p = 3 — #/2n. To do 
this, construct the “residual demand curve”—that is, the 
demand] curve Dj(p;|p-,) for firm i when the other firms 
charge price equal to ¥ — I/2n. Demonstrate that this 
demand|curve has a kink at 3 — t/2n, and draw the iso- 
profit e in the same space (D,, p;). Derive n° and p* as 
function of f and ¢. Argue that the results obtained are 
a priori “counterintuitive,” and explain them. 


7.1.2.2 | Discussion 
The above model was built to address the question of 
the number of firms. There are three natural extensions 


that woild make it more realistic: the introduction of a 
location |choice, the possibility that firms do not enter 
iauiltantousty, and the possibility that a firm locates at 
several points in the product space. 


Location Choice 


The model assumes that firms locate equidistantly. This 
assumptibn is appealing given the maximal-differentiation 
result of}d’Aspremont et al. for quadratic transportation 


costs ina linear city. It has been justified in the context of 
the circular city only in the case of quadratic transpor- 
tation costs. Economides (1984) considers a three-stage 
game in which firms choose whether to enter, then 
choose locations on the circle, and then compete in prices. 
He vindicates the equal-spacing assumption by showing 
that for quadratic costs there exists a free-entry sym- 
metric equilibrium (in locations and prices). 


Sequential Entry 


The assumption that all firms enter simultaneously is con- 
venient because it eliminates the strategic aspects of prod- 
uct positioning (a product choice may deter entry or 
affect one’s rival subsequent choice of product). However, 
one generally thinks of the firms possibly (but not neces- 
sarily) entering in sequence.* There are two issues here. 
First, positing that firms move sequentially. what is the 
equilibrium pattern of location, given that firms take into 
account the fact that their location decision in the product 
space affects future locations? (Prescott and Visscher [1977] 
did some pioneering work in this area, studying sequential 
location decisions by firms in a linear city.) Second, what 
is the optimal timing of entry?” Giving a satisfactory 
answer to this second question requires the introduction 
of further elements into the model (for instance, in a 
spatial model, the rate of growth of demand or popula- 
tion, or the rate at which the production cost decreases), 


Brand Proliferation 


A crucial assumption in the previous analysis is that each 
firm is allowed only one brand. But a firm may produce 
several brands and crowd the product space, leaving no 
room for entry by another firm (this possibility is all the 
more likely if preemption is feasible). Indeed, it will be 
argued in subsequent chapters that because a monopoly 
makes more profit than an oligopoly carrying the same 
brands and facing the same technology (because competi- 
tion lowers profits), an incumbent firm has a greater in- 
centive to introduce new products than a entrant has. As 
we will see, this efficiency effect tends to bias the market 
structure toward multibrand monopoly.'? 


B The reallworld is often better approximated by a continuous-time entry 
model than| by a one-period entry model. The one-period set-up can be 
obtained ony in special cases, such as the case where long information lags 
about one’s fivals’ entry decisions cause firms to (ie facta} move simultaneously. 


9. This question raises the important issue of preemption, which will be 
examined in various models in chapters 8 and 10 


10. See chapters 8 and 10 for more detail. and for a description of offsetting 
effects that encourage an oligopoly structure. 


Product Differentiation 


7.1.3 Maximal or Minimal Differentiation? 


Clearly, ie spatial models, and others like them (eg., 
the vertital-differentiation (quality) model in the supple- 
mentary |section), are only abstractions, Nevertheless, 
they are huite useful, for they shed light on the nature of 
price rivalry. Furthermore, they make important predic- 
tions abqut business strategies. One such prediction is the 
principle] of differentiation: Firms want to differentiate 
to soften] price competition. This corresponds to the rec~ 
ommendhtion found in most marketing texts concerning 
market sbgmentation, and to the observation that firms 
do wily differentiate in the real world.'! In some 
cases, fitms look for maximal product differentiation. 
While thpse recommendations are economically very ap- 
pealing, re exist forces that oppose maximal product 
differentintion, and even forces that oppose any product 
differentiption, These can be put into three categories: 


Be where the demand is It is clear that, although firms like 
to differdntiate for strategic purposes, they also all want 
to locate|where the demand is (e.g., near the center of the 
linear city). In the example of d’Aspremont et al., these 
two forces conflict, and we did not derive any intuition 
as to why the strategic effect dominates. Thus, it is 
not surpfising that one can construct examples in which 
firms difeeat but not fully.'? This is even clearer 
in markets where demand is concentrated around a few 
poles,'’ Which may explain the abundance of ice cream 
parlors and bookstores near universities. (In other product 
spaces, alfferentiation in laundry detergents or gasoline is 


to use and afe directed toward home use whereas IBM's are intended mainly for 
office or prdfessional use. (Apple is, however. trying to crack the office market 
‘This relates] to the need—discussed below—to be where the demand is) A 
vertical-difffrentiation example is that of Mercedes and Rolls-Royce, which 
have found] profitable niches in the automobile product space. See also the 
discussion qn advertising and information in section 7.3 


LL, For ofr in the personal computer industry, Apple's products are easy 


12. A trivipl example is obtained by fictitiously extending the city in the 
example of|d’Aspremant et al. (e.g. to cover the segment located between 
r=-t = 1 = 2), However, the subsegments between r= —1 and x= 0 
and betwee = | and x = 2 are’uninhabited; the consumers are uniformly 
distributed hetween 0 and 1. Clearly, the analysis of d’Aspremont et al. ts not 
affected; however, the equilibrium locations (x = 0 and x = 1) are no longer 
extreme nf city’s territory, 

In a monp: interesting example, Economides (1986) takes up the linear-city 
model with] transportation cost td*, where d is the distance to the shop and a 
belongs to{1, 2]. He shows that a pure-strategy price equilibrium exists for 
2 > 1,26, afd that differentiation is not maximal for a in [1.26. 1.67], whereas it 
9 for 2 in [4.67 2) 
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not of much interest to most consumers.) Of course, if 
firms are to locate at the same place, they must be able to 
relax price competition somehow, From chapters 5 and 6, 
we know that this may be implemented either by restrain- 
ing capacity or by tacitly colluding. Another possibility is 
that firms may be differentiated by attributes other than 
location, For instance, not all consumers may have the 
same taste for various brands of ice cream independent of 
their generalized costs,’* or they may not all have the 
same information about products (see section 7.3), 


Positive externalities between firms There may be exter- 
nalities that induce firms to locate near one another. On 
the cost side, one can think of common installations and 
trade centers; for instance, fishermen may converge to the 
same harbor to sell their fish, even if this means more 
intense competition, Another possibility is that many 
firms may locate near a source of raw materials. On the 
demand side, search by consumers may encourage firms 
to gather, Doing so lowers search costs and increases 
aggregate demand for these firms, If the increase in de- 
mand is not offset by a considerable increase in price com- 
petition, such a strategy may be worthwhile. This may 
explain the scores of furniture stores along the Faubourg 
Saint Antoine in Paris's; the point here is that in a world 
with at least some amount of product differentiation other 
than location (e.g., different furniture designs), consumers 
find it convenient to search next door if they do not find 
their preferred item in a given shop.'® 


Absence of price competition Product differentiation is 
meant to relax price competition. In some instances, there 


13. Here we take poles as exogenous. They may in fact result not only from 
the consumers’ location pattern but also from the decisions by firms selling, 
goods that are complements (or even imperfect substitutes) to locate near one 
another—think of shopping malls. See Stahl 1982a and Eaton 1982 


14. De Palma et al. (1985) introduce this possibility in the linear-city model 
with linear transportation costs. Instead of assuming that the consumers all 
derive the same surplus ¥ from any product. they consider a product-specific 
surplus: 5, =3 + jue, where 1 is a parameter and », |s a consumer-and-firm- 
specific random variable. They find that if 1 becomes large enough. there exists 
an equilibrium in which both firms locate at the center, When 4 is large, 
products are very differentiated even if they have the same spatial location, so 
the strategic effect is weak, The direct, demand effect then commands firms to 
locate where the demand is. 


15, Other Parisian examples include the Au Printemps and the Galeries 
LaFayette department stores and the seafood restaurants on Place Clichy. It 
would be interesting to study whether the disappearance of some districts in 
New York is due to a decrease in search costs and information costs 


16, The literature on the interaction between consumer search and firm loca- 
tion is scant. Stahl (1982b) has made a start 
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price contpetition is limited. For instance, the prices of 
airline tickets in the United States (before deregulation) 
were detfrmined exogenously, as the prices of gas and 


may et legal or technical reasons why the scope of 


books in Fyance once were.'” (Similarly, consumer prices 
may be fetermined by resale-price-maintenance agree- 
ments ps by manufacturers.) It is, thus, clear that 
the incentive to differentiate products decreases when 
firms do pot compete in prices. Indeed, Hotelling (1929) 
enunciate the principle of minimal differentiation in such 
circumstahces. To see why minimal differentiation may 
arise in some location models, consider his model of a 
linear ea with two firms, Suppose that the price (p > c) 
is exogerfously fixed, and that each firm chooses a loca- 
tion on alsegment of length 1 with a uniform distribution 
of consumers. Suppose further that the firms share de- 
mand eqially if they are located identically. Because the 
prices andl the profit margins are fixed, the firms choose 
their locations so as to maximize demand. Let firm 1 
locate at point a and firm 2 at point 1 — b, where, with- 
out loss of generality, 0 <a < 1 — b < 1. Suppose that 
these locations differ: a < 1 — b. Let us show that firm 1, 
for instante, would like to move toward b. Its demand is 


a 
a+ 


it therefore increases with a. This is natural, as the firms 
are competing for the consumers located between them. 
Thus, anJequilibrium must involve identical locations: a 
= 1— b| Suppose now that a= 1 — b <}. Each firm's 
demand js 4. But by moving to the right by « > 0, firm 
2, for trabince, would have demand 


1—b+e Leer e 
2 


(b =e) 4 


Thus, the firms would want to move toward the center. 
By the same reasoning, for a = 1— b = 4 neither firm 
would want to move. Thus, the only equilibrium has both 
firms located at the center of the city.'® In this example, 
the products are socially too close to each other. Trans- 
portation costs could be reduced by having the firms 
move away from the center. (As was noted in subsection 
7.1.1, the locations midway between the center and the 
extremities minimize transportation costs.) Such a loca- 
tion model without price competition may explain why 
political platforms tend to cluster around the center, and 
why similar TV shows (news, movies, etc.) compete in the 
same time slots of the major networks in many countries. 


7.2 Monopolistic Competition 


Monopolistic competition was introduced by Chamberlin 
(1933) to formalize the following industry configuration 
(here we continue to assume that each firm produces at 
most one product, in order to abstract from the issue of 
brand proliferation by a given firm): 


(i) Each firm faces a downward-sloping demand. 
(ii) Each firm makes no profit. 


(iii) A price change by one firm has only a negligible 
effect on the demand of any other firm.'® 


Properties i and ii are satisfied by the zero-profit equi- 
librium of the Salop model of section 7.1. Property iii 
distinguishes monopolistic competition from the previous 
oligopolistic competition with free entry. It says that each 
firm or product has no direct neighbor in the product 
space. This absence of cross-effects has been widely criti- 
cized. Except in a few cases (see the monopolistic com- 
petition model of informational product differentiation in 


17, To be sre, some forms of “nan-price decisions” are meant to substitute for 
price cuts, Fhe iratante, vom big book retailers in France have offered to buy 
used books) back at advantageous prices and have introduced benefits for 
faithful consumers, and airlines once supplied lavish on-board services. Firms 
will always [ind ingenious loopholes in the regulatory system: however, such 
non-price cqmpetition is not necessarily a perfect substitute for price competi- 
tion, and “thectore it may yield some undesirable welfare losses besides 
the benefits [intended by the regulators. Consumers may prefer a direct price 
reduction, 


18, It can he shown, for instance, that with three firms no pure-strategy 
location equflibrium with exogenously fixed price exists, (For some nonexistence 
results, see Haton and Lipsey 1975. For more references and for mixed-strategy 
equilibrium,|see Dasgupta and Maskin 1986.) However, an equilibrium does 
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exist if firms enter sequentially rather than simultaneously (Prescott and Visscher 
1977). The first and second firms locate at 4 and 2. and the third firm locates 
between them. 


19. Other (nonequivalent) properties sometimes replace property iii to yield 
alternative definitions of monopolistic competition. One such property (weaker 
than iii) is the perfect-symmetry axiom (also called the absence-of-localization 
axiom): A change in the demand for good i (due to a change in its price, 
advertising level, etc) does not affect the ratio of good j’s sales to good k's 
sales for all j, k # 1. Schmalensee (1985) has developed an econometric test for 
the presence of localized competition. The application of this test to data on 
the US, ready-to-eat-breakfast-cereal industry signals a misspecification of the 
perfect-symmetry model. 


Product Differentiation 


section| 7.3), existing products compete directly with a 
few prdqducts (or when they do not, as in the monopoly 
case, pfoperty ii is likely to be violated). The point of 
monopplistic competition is thus not to study strategic 
aspects| between products (such as product positioning 
and prite competition), but rather to abstract from them 
to sn the analysis and study other issues, such as the 
of products offered by a market economy. 

mf has long been a “conventional wisdom” that 


numl 


monopplistic competition yields too many firms from a 
social yiewpoint, or that the existing firms produce too 
little to|exhaust returns to scale (“excess capacity”). The 
reasonihg runs as follows: Suppose that firms have U- 
shaped hverage-cost curves. Let D,( p;, p_;) be the residual 
derand curve of firm i; that is, its demand curve given 
the vector of prices p_, charged by the other firms. A 
free-entty equilibrium requires that each firm make zero 
profit i simply, that firm i produce at a point (pf, 4) 
such that the residual demand curve is tangent to the 
averagetcost curve at this point (see figure 7.3, where 
AC(q,) ik the average cost at q,). The quantity produced is 
less hah th quantity (q7) that minimizes average cost. 
Therefofe, the fixed cost is spread over too few units, and 


waste ofcurs. 

Atay this reasoning is flawed. If no other firm 
produce} the same good (the other goods are different 
from thht of firm i), the introduction of this good can 
be justifled even if its production does not exhaust econ- 
omies ol scale. (This is even more obvious in the preced- 
ing location example, where the average cost is ¢ + f/q, 
and the quantity that minimizes average cost—the most 
efficient| scale—is therefore q? = +0.) Therefore, the 
reasoning must require firm i to produce a good that is 
already |being produced by another firm, But then its 
demand|curve is horizontal at the price (p,) charged by 
the othey firm (or at the minimum of such prices if there 
are many other firms that produce this good), and de- 
creasing|thereafter; see figure 7.4, 

Geometrically, it is easy to see that the only output for 
firm i a can satisfy both the condition that firm i makes 
zero profit and the condition that firm j sells a positive 
quantity] (otherwise, its own profit must be negative be- 
cause of the fixed cost) is qf (with p, = p?). Therefore, we 
cannot donclude that there is “excess capacity” or that 
there are] too many firms. 

More|recent research (Dixit and Stiglitz 1977, Spence 
1976) has shown that this argument cannot be patched 
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AC(q)) 


I-— 4 = D(p/p_,) 


q 


Dip/p,) 


q% 


Figure 7.4 
Demand curve for perfectly substitutable goods 


up. In general, there are two effects operating in opposite 
directions: 
Nonappropriability of social surplus ‘This effect, discussed 
in the monopoly case in chapter 2, says that a firm cannot 
generally capture the whole consumer surplus associated 
with the introduction of a good. The positive externality 
on consumers implies that firms tend to introduce socially 
too few products. 
Business stealing (trade diversion) By introducing a pro- 
duct, a firm steals consumers from other firms. The rivals, 
who have a positive profit margin, lose income from 
these diverted consumers. This negative externality on 
other firms implies that firms tend to introduce too many 
products. 

In general, the free-entry outcome (like the monopoly 
outcome; see chapter 2) can involve socially too few or 


too many products. More definite conclusions can be 
obtained only for specific models. 


Chapter 7 


For|more on the Dixit-Stiglitz-Spence model, see sub- 
section 7.5.2, 


7.3 Ravertising and Informational Product 


Differentiation 


Let us]now apply some of the above ideas to informa- 
tional jeter od We will focus on the effect of adver- 
tising pn consumer demand and product differentiation. 

Advertising has many media, including television, 
radio, hewspapers, magazines, and direct mail.?° It con- 
stitute} a non-negligible industry in itself (2 to 3 percent 
of the GNP in the United States in 1984, somewhat less 
in othdr developed countries),?! 

Con}petition in advertising is one of the main dimen- 
sions $f non-price competition. As was mentioned in 
chaptef 2, the study of advertising strains the rational- 
consunher model perhaps more than other topics. Though 
it will be argued that advertising may foster competition 
by incjeasing the elasticity of demand (reducing “differ- 
entiatiqn’"), it is easy to find cases in which the reverse is 
true. For instance, ads seem to succeed in differentiating 
products that are physically almost identical (laundry 
detergents, beers, etc.), Subsection 7.3.1 reviews the con- 
ventiorfal wisdom about advertising. 

Subsection 7.3.2 presents the best case for advertising. 
We adppt the view that advertising brings useful infor- 
mation|to the consumer (this view stems from Telser’s 
(1964) |response to economists’ traditional distaste for 
advertiting —see, e.g., Kaldor 1950). We formalize adver- 
tising ap yielding information about the very existence of 
the prdduct and its price (information about location 
or eyes characteristics would follow a somewhat 
similar pattern). The assumption in this subsection is that 
informational issues can be solved—at a cost—through 
advertiging (in the terminology of chapter 2, the good is 
a_searcl good), We investigate whether advertising re- 


duces jproduct differentiation, and whether the market 
supplie¥ too much or too little advertising. 


Subsection 7.3.3 briefly mentions another link between 
consumer information and product differentiation. There 
we consider experience goods rather than search goods. We 
assume that manufacturers cannot directly inform con- 
sumers about product quality through advertising, and 
that consumer learning occurs through consumption. Dif- 
ferent consumers at a given point in time have different 
information about the various products’ qualities, which 
stems from their earlier patterns of consumption. Thus, a 
consumer may fail to perceive two goods that are equal 
in quality (and not differentiated on other characteristics) 
as perfect substitutes because he has tried and knows the 
quality of only one of the goods. 


7.3.1 Views on Advertising*? 


We can distinguish two polar views on advertising. 

The partial view sees advertising as providing informa- 
tion to the consumers and thus enabling them to make 
rational choices. Advertising announces the existence 
of a product, quotes its price, informs consumers about 
retail locations, and describes the product's quality. It re- 
duces the consumers’ search costs and helps them choose 
between brands. Thus, advertising reduces product dif- 
ferentiation associated with a lack of information about 
some products and fosters competition. Similarly, it facil- 
itates the entry of new firms, who can capture the demand 
of established firms. It also encourages the production of 
high-quality goods. High-quality firms have an incentive 
to reveal their quality through advertising, which puts 
low-quality firms at a disadvantage. 

Newspapers are the favorite medium of the proponents 
of this view. Newspaper advertising in the United States 
is, for the most part, local. It often contains information 
about price, attributes, and retail location. The favorite 
products of these proponents are eyeglasses, prescription 
drugs. and food. Benham (1972) argues that the average 
price of eyeglasses is significantly higher in states where 
advertising is prohibited. The explanation seems to be 
that advertising informs consumers and fosters price com- 


20. To give orders of magnitude: The shares of total advertising expenditures 
in 1984 in} the United States were 23% for television (network. spot. local, and 
cable), 794 for radio, 27% for newspapers, 6% for magazines and 16% for direct 
mail, Thete are also non-advertising promotional expenses, eg. salespersons, 
lobbyists, land public-relations staffs 


21, Adveftising expenditures vary greatly according ta the product, For 
instance, 4ccording to 1975 data from the Federal Trade Commission, drugs, 
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perfumes, and breakfast cereals had media-advertising expense-to-sales ratios 
between 10% and 20% (between 20% and 35% if one includes all selling 
expenses), Of course, beet sugar, railroad equipment. and guided missiles are 
hardly advertised, 


22. This section was inspired by lectures given by Paul Joskow and Richard 
Schmalensee: 
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petitioh? (see also Cady [1976] on prescription drugs and 
Steine# [1973] on how the availability of network tele- 
vision| advertising changed the nature of competition 
in the] toy industry), These proponents also point to 
efforts| of some professionals (e.g. pharmacists, medical 
profestionals, and lawyers) to impose legal restrictions 
on advertising. 

The} adverse view claims that advertising is meant to 
persuaHe and fool consumers. It creates differentiation 
that is} not real, rather than reducing real informational 
differehtiation (see, e.g., Galbraith 1967 and Solow 1967 
on the] view that consumers are manipulated by Madison 
Avenue), Thus it reduces product competition; it also 
increages barriers to entry. 

The| favorite medium of the proponents of this view is 
netwotk television, Network TV advertising is mainly 
national. It is very image-oriented, and it conveys little 
informption beyond the existence of the product. Their 
favorite products are cigarettes, Clorox (which commands 
a high|price premium although it is basically identical to 
rival products), and beer. They point to Nichols’ (1951) 
famou} study of the cigarette industry, which shows that, 
from the 1920s on, cigarette manufacturers competed 
through advertising and brand proliferation rather than 
or price cuts (prices seem to have been fairly collu- 
sive) qr improvements in quality. Hence, competition in 
this ed seems to have been socially wasteful. 

Cerjainly, each of these views has some merit. The 
relevaitce of each seems to depend on the product, the 
nature |of consumer demand, and the advertising medium. 
In the rest of this section, we will investigate the role of 
advert|sing assuming full consumer rationality. 


7.3.2 | Search Goods and Informative Advertising 


In this} subsection, we assume that advertising conveys 
informption on existence and price.** We consider two 
such njodels. The first, due to Butters (1977), is one of 
monopolistic competition (each of a large number of firms 
faces 4 downward-sloping demand, but their effects on 


the rivals are negligible). Differentiation between pro- 
ducts stems entirely from consumers’ knowing about only 
a restricted number of brands (possibly one or none). The 
second model, due to Grossman and Shapiro (1984), looks 
at oligopolistic interaction in a Butters-type model; it also 
allows product differentiation along another dimension 
(location). Our main focus is on whether monopolistic or 
oligopolistic interaction yields socially too much or too 
little advertising. As will become clear, our discussion 
rejoins the previous study of product diversity. 


7.3.2.1 Monopolistic Competition 


In Butters’ model, all firms offer the same product (there 
is no horizontal or vertical differentiation). Production of 
the good exhibits constant returns to scale, at unit cost c. 
Each consumer has unit demand for this product. A con- 
sumer's utility is U = 5 — p if he buys one unit at price p 
and 0 if he does not buy. If the consumers were perfectly 
informed about all firms, Bertrand competition would 
drive the price to the marginal cost, and each consumer 
would buy one unit and enjoy utility 5 — c. However, we 
assume that informing the consumers about the existence 
and the price of a brand is costly. More specifically, let us 
assume that the only way to reach consumers is to send 
advertisements at random. An advertisement also quotes 
the price charged by the firm that sends it. (Because the 
consumers cannot search in this model, the term search 
good is unfortunate here.) If there are N consumers, each 
consumer has a probability 1/N of receiving a given ad. 
A consumer thus can receive 0, 1, ads. If he receives 
none, he buys nothing; if he receives one, he buys from 
the corresponding firm as long as the price does not 
exceed 5; if he receives several, he chooses the lowest 
price if it does not exceed 5. In the case of a tie, he 
chooses randomly between low-price brands. 

As in section 2.2, let s denote the total number of ads 
sent to consumers by all firms. A consumer's probability 
of not getting an ad at all is 


23. There might be a change in the quality of services. Benham mentions that 
in 1963, 183.4% of consumers were served by physicians and optometrists in 
states witere advertising was prohibited, versus 53.2% in states where it was 
permitted, However, Kwoka (1984) presents empirical evidence that advertis- 
ing leads|to no erosion of quality in this industry. 


24, The field owes much to the early analyses of Dehez and Jacquemin (1975), 
Dorfmar] and Steiner (1954), Friedman (1983), Nerlove and Arrow (1962), 
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Schmalensee (1972, 1976, 1978), and Spence (1980), all of whom assumed that 
the demand for a product depends on its price and on the amount of advertis- 
ing. Their approach is useful for positive analysis (prediction of price and 
advertising expenditures), It is less amenable to normative analysis, because it 
does not detail the channel through which advertising enters the consumers’ 
preferences. To perform welfare analysis, we choose to be move explicit (and 
also less general) about the effect of advertising on demand. For a more general 
overview of advertising and market structure, see Schmalensee 1986. 
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1\ 

1-©® (:-3) Toad 

for N latge. Conversely, where c’ denotes the unit cost of 
sending] an ad (again, we assume constant returns), the 
social cost of making sure that the fraction ® in (0, 1) of 
consumprs receives at least one ad is 


wale, 1 
A(®) =|'s= en In(—*5). 


The advertising cost per consumer is thus 


a) 


Assume that 5 > c +c’ (otherwise, no advertising and 
production would ever take place). 

Let uk first consider a free-entry equilibriwm, Firms here 
do not face entry costs or fixed costs. Each firm is negli- 
gible wth respect to the market. Clearly, ads specifying 
prices Helow c+ c’ or above 5 will not be sent (pric- 
ing beldw c+ c’ yields negative profit; pricing above 5 
automatically induces no demand). Butters shows that 
any pride between c +c’ and 5 is advertised in equilibrium 
by some firm. The trade-off between two prices is the 
followig: A higher price yields a higher profit margin, 
but has} a lower probability of being accepted, as the 
probability increases that the consumer receives another 
ad with]a lower price. More formally, let x(p) denote the 
probability that an ad at price p is accepted by the con- 
sumer who receives it, i.e,, the probability that this con- 
sumer does not receive another ad specifying a lower 
price. x{p) can be thought of as a (downward-sloping) 
demand function. In equilibrium, an ad must make a zero 
expected profit whatever its price (if it made a positive 
profit, lew firms would enter the market and send ads at 
the sane price until the probability of acceptance fell 
enough] to reestablish the zero-profit condition). That is, 
for all pin [c + ¢',5], 


(p—c)ke(p) —c’ = 0, (7.12) 


Note tat x(c + c’) = 1; otherwise, an ad at c + c’ would 


lose money. Conversely, one can recover the distribution 
of ads from the probability x(p). 

As we will see shortly, the important datum for as- 
sessing welfare in this model is the level of advertising. 
This level is obtained from equation 7.12. The probability 
1 — ®* that a consumer does not receive an ad is equal to 
the probability x() that an ad at the highest possible 
price of the good triggers a sale (the only chance that 
such an ad will trigger a sale is if it is the only ad received 
by the consumer). From equation 7.12, 


Let us now consider social welfare. Because of the 
unit-demand assumption and the homogeneity of con- 
sumers, the market does not introduce any consumption 
distortion. A consumer who receives at least one ad con- 
sumes one unit of the good, which is socially optimal. 
The only potential distortion in this economy is related 
to the number of ads, or equivalently, to the fraction 
(1 — ®) of consumers who are not reached by any ad. 
Because consumption by a consumer yields social surplus 
5 —c, a social planner would choose a fraction ©" so as 
to maximize as follows: 


; 1 
max | 0 — 0 —c in(; = a) 


The first-order condition is 


po es = oF 
al ee Oord* = O°. 


Thus, the monopolistically competitive level of advertising 
is socially optimal. To understand this result, note that, 
from the zero-profit condition, the firm's incentive to 
send an ad, (p — ¢)x(p), is independent of price p (in other 
words, all ads at prices between c+ c’ and 5 yield the 
same profit), This incentive is equal to the gain from 
sending an ad at price , Such an ad yields 5 —c with a 
probability equal to the probability that the consumer 
receives no ad. But this is exactly the social planners 
incentive. An ad increases the social surplus (by 5 — ¢) 
only if the consumer receives no other ad.?* 


The firm’d profit margin, p — c, is lower than the social surplus, ¥ — c; the firm 
does not hppropriate the whole surplus. However, the ad generates demand 
with probpbility x(p), which exceeds the socially relevant probability that the 
consumer|receives no other ad (1 ~ ®). The difference between the two prob- 


25. For te p <5, the private incentive can be decomposed into two effects, 
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abilities corresponds to the probability that the firm diverts the consumer from 
another firm (a business-stealing effect). We will come back to these two effects 
when we discuss oligopoly. Note that the optimality result means that the two 
effects offset each other exactly here, 


Product Differentiation 


Thi} optimality result is quite striking. Even though it 
does 1 hold for more general models (see below), it 
shows| that informative advertising need not be distorted 
in mafket equilibrium. It relies on specific assumptions 
(unit demands, consumer homogeneity, etc.). An inter- 
esting} variant of the model, considered by Butters, 
allows|consumer search. Thus, consumers can be informed 
through two channels: advertising and (if they receive no 
ad) Sou search. Butters shows that the monopolisti- 
cally space equilibrium involves too much advertis- 
ing anHi not enough search. 

Angther striking result reported by Butters is the equi- 
libriun} price dispersion. There is a vast literature on 
price dispersion when consumers get informed through 
search| rather than advertising. See Bénabou 1986a,b 
for a general model of search and for references to earlier 
work.4° 


7.3.2.4 Oligopoly 


Grossinan and Shapiro (1984) build on Butters to analyze 
informative advertising in oligopolistic interaction. They 
combihe the preceding model with the circular-city one. 
Thus, Firms are differentiated along two dimensions: in- 
formation and location. We shall employ a model slightly 
simple} than theirs and shall abstract from the entry ques- 
tion they consider to focus on the effect of advertising on 
the elasticity of individual demands and on the appropri- 
ability]and business-stealing effects. 

Let jus use the linear-city model of section 7.1, (There 
are twp firms located at the two extremes of a segment of 
length] 1. Consumers are distributed uniformly along the 
sev with density 1; they derive gross surplus 5 from 
consurping the good, and they incur transportation cost t 
per unft of distance.) As in Butters’ case, a consumer can 


consume a product if and only if he receives an ad from 
the corresponding firm. Let ®, (i = 1,2) denote the frac- 
tion of consumers who receive an ad from firm i. Like 
Grossman and Shapiro, we assume that advertising is not 
localized. Thus, consumers located along the segment 
have equal chances of receiving a given ad. (Extending 
the model to allow the targeting of ads would be worth- 
while.) The cost of reaching fraction ®, of consumers 
is A(®,), where A’ > 0 and A" > 0. (With the Butters 
technology, 


A(@) = e'n(; +s) 


However, the technology can be more general, To 
simplify the computations, we will assume that A(®,) 
= a?/2, with a maximum advertising expenditure of 
a/2.) 

The potential demand for firm 1, say, has size ®,. It can 
be decomposed into two parts. A fraction 1 — ®, of this 
potential demand does not receive an ad from firm 2. It 
can thus be considered firm 1’s turf. Each consumer in this 
market segment is willing to pay a price up to 5 minus the 
transportation cost to firm 1. A fraction ®, also receives 
at least one ad from firm 2, and therefore constitutes 
a more elastic or competitive fragment of demand. We 
assume that firms cannot price-discriminate. They choose 
prices p, and p3. In the sequel, we consider only the 
case in which the two firms compete for this “common 
demand.” In particular, we look at equilibria with overlap- 
ping market areas for firms among the fully informed con- 
sumers, Thus, we focus on the competitive case, which 
will allow useful comparisons with the no-advertising 
model of section 7.1.2” Recall from section 7.1 that the 
demand for firm 1 under full information is 


26, Bénabou (1986b) obtains price dispersion in a monopolistic competition 
model in] which nominal prices are costly to adjust (because of “menu costs”) 
and ne creates a need for nominal price adjustments. He shows that 
inflation kzenerates price dispersion, which makes search potentially more pro- 
fitable arfd thereby increases price competition. Inflation thus yields lower real 
prices anf! fewer firms in equilibrium 

Gertn¢r (1987) considers an oligopoly model in which a higher variance of 
inflation Increases the firms’ market power for high search costs. The idea is that 
for a detfrministic inflation rate. the consumers interpret a firm's high price as 
stemming from a high idiosyncratic cost and may keep on searching for a 
lower-coft firm (depending on their search cost). Under a random inflation rate, 
the high|price is due either to a high idiosyncratic cost or to a high inflation. 
which thpn affects the firm's rivals as well. The consumers’ incentive to keep 
sh is thus reduced, and the firms’ market power is increased. It is 
interestirfg to note that Bénabou finds a negative correlation between the level 


of inflation and market power, whereas Gertner finds a positive correlation 
between the variance of inflation and market power. (Of course, the two effects 
may offset each other somewhat. as the inflation rate and its variance are 
generally positively correlated.) 

Bénabou (1986a) also looks at price reputation and repeat purchases with 
search. 


27. Roughly, the competitive case arises if the advertising costs are not too 
high. so that a large fraction of the market is covered by both firms, In this case, 
charging a high price and focusing on one’s own turf does not yield enough 

demand, and firms indeed try to compete for the common demand. In the 
earn teal advertising example, this is guaranteed if a does not exceed 
W2 too much. (Values a < t/2 are not considered here, because they yield 
©, =, =1 in equilibrium, which is the full-information case considered in 
section 7.1.) 
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(pa pi + H/2E 
Thus, 


D4 ofa —,)+ o,(2—p+*)) 


Note} that the elasticity of demand at p, = p, = p and 
0, 4, = ois 


Dif © 
 =|->=— = 


Pp (2-9) 


The ¢lasticity of demand is an increasing function of ®, 
and thus is increasing with advertising. 

Consider the game in which the two firms simultane- 
ously| choose prices and advertising levels. For instance, 
firm I's behavior is described by 


max {ofa —,)+ 0,(map et 
(o..0)) 


= Ai@,)}, 


and bi 2's behavior similarly. Now specialize the model 
to quhdratic advertising costs A(®,) = a@}?/2. The two 
eee conditions are (note that a firm's pricing pol- 


icy is independent of its advertising level) 
+t+ ©. 
y= POE (7.13) 
2 
and 


®, = Hr = aft ~o,+ @,(2—a**)] (7.14) 


These }equations have straightforward interpretations. 
The fitst term on the right-hand side of 7.13 is the 
reactioh function under full consumer information. The 
second] term is an extra mark-up associated with the 
existenke of a turf; it reflects the fact that the elasticity of 
demand is lower than under full information. Equation 
7.14 states the equality between the marginal cost of 
advertiting a, and the marginal benefit, which is equal 
to the [profit margin times the probability of sale. Be- 


cause the game is symmetric, let us look for a symmetric 
equilibrium (pf = py = p*, and | = 5 = *), Solving 
equations 7.13 and 7.14, assuming a > t/2, yields 


2-0 
po=ct+t o =c+ ,/2at, (7.15) 
2 
=S— (7.16) 
2a/t 
and 
Tt =1P (7.17) 


> Ga 
(1+ J2a/t)?” 
Several interesting conclusions can be derived from this 
simple model (these conclusions are, of course, valid only 
in the competitive range): 
+ The price, p*, exceeds that under full information (c + #). 
This is due to the lower elasticity of demand associated 
with informational differentiation. As in section 7.1, the 
price increases with the parameter of horizontal differ- 
entiation, + (but not as rapidly, because more horizontal 
differentiation yields a higher margin and therefore en- 
courages more advertising and creates a bigger common 
market). The price also increases with the cost of advertis- 
ing, a. 
+ The lower the advertising cost and the higher the hori- 
zontal differentiation, the more the firms advertise. 
+ Profits increase with the transportation cost, as in sec- 
tion 7.1. What is more remarkable, they increase with the 
cost of advertising. The direct effect of an increase in a 
(for p and ® given) is to reduce the firms’ profits. However, 
there is a strategic effect: An increase in advertising costs 
reduces advertising and thus increases informational pro- 
duct differentiation. This allows firms to raise the price. In 
this example, they gain more from costlier advertising 
than they lose. This result is not general, but it strongly 
exemplifies the role of advertising in reducing product 
differentiation. It may also shed some light on why some 
professions do not resist—and sometimes encourage— 
legal restrictions on advertising.?* 


28, In thi model, a small limitation on advertising (such as a small tax on it) 
An outright prohibition would hurt them, because they would 
then be le to reach consumers at all. Of course, in reality. in the absence of 
advertising, consumers would substitute for advertising by searching (including 
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using word of mouth). Thus, a reasonable description of such a prohibition 
would require enriching the model. 

Peters 1984 constructs a model of a homogeneous-good industry in which 
Price advertising is costless, In an advertising equilibrium. all consumers are 


Product Differentiation 


Compare the market level of advertising to the socially 
optimal Jevel of advertising, ®*.?” It is easy to see that O* 
can be ejther greater or less than ®*—there can be either 
excessive or suboptimal advertising in equilibrium. This 
result sHould not surprise us very much. Advertising, as 
describe}! in this model, is very much like creating a good, 
in that i offers a new good to the consumer who receives 


the ad. The following effects can now be distinguished: 


Nonapprppriability of social surplus Because of the com- 
petitive |pressure, the firms’ profit margin is lower than 


the social surplus associated with the consumption of the 
good, Thus, each firm has too low an incentive to create 
the goo! (i.e., to advertise). 


Business|stealing Here a firm creates little social surplus if 
its ad is|received by a consumer who also receives an ad 
from its}rival; it saves at most some transportation #, and 
usually ess (because it captures the segment located next 
to it, it saves an average of 3t/4 — t/4 = t/2 on such a 
consumfr), But the markup exceeds t, The loser is not the 
consumfr, who saves on transportation costs, but the 
rival, who loses the profit margin p* — c on the diverted 
consumfr. Thus, there may be excessive advertising. 


This type of model is too crude to allow definite con- 
clusions (beyond that there may be overadvertising or 
underadvertising). However, it constitutes the first step 
toward h richer analysis, and it also supplies a framework 
with which empirical findings can be confronted, For 
instancd, consider the following stylized fact: “In broad 
sampleg of manufacturing industries, especially those pro- 
ducing |consumer goods, advertising intensity is posi- 
tively aplateck to industry-average profitability.” (Schma- 


lensee 1986b, stylized fact 3-10)2° As Schmalensee 
wams us, such correlations should not be interpreted as 
causal relationships. Advertising does not increase profits, 
and profits do not generate advertising. Both variables 
are jointly determined (i.e, endogenous), as is shown, for 
example, by the reduced-form equations 7.16 and 7.17. In 
the previous model, the exogenous variables are t and 
a. If horizontal differentiation increases, both advertising 
and profit increase, which is consistent with the stylized 
fact. However, if the cost of advertising increases, adver- 
tising goes down and profits go up, implying a negative 
relationship between the two. Thus, it all depends 
on which exogenous variable varies significantly in the 
sample distribution. If we take this crude model as a 
paradigm, we attribute the positive correlation to the 
possibility that product differentiation varies more than 
advertising costs across manufacturing industries.” 


7.3.3 Experience Goods: Informational 
Differentiation and Goodwill 


The previous analysis dealt with search goods, Advertis- 
ing could bring the consumers information about the 
good’s existence, retail location, price, characteristics, 
and so forth. Goods were differentiated, because costly 
(and therefore incomplete) advertising campaigns did not 
inform all consumers about all goods. For experience 
goods, informational differentiation may also arise from 
the consumers’ imperfect knowledge of the products’ 
quality or fit with their preferences (see chapter 2 for 
quality and matching models in monopoly). Consumers, 
who learn their utility of consuming a good from experi- 
ence, generally know about only one brand or only a few, 


search. THere are two classes of consumers: zero-search-cost consumers’ who 
get costlepsly informed about all prices. and high search-cost buyers, who visit 
only one seller. Peters shows that high-production-cost producers are made 
better off by advertising restrictions and that those with low production costs 
are made Jorse off (His other conclusions, such as that advertising restrictions 
may caus} average prices in an industry to fall. are more ambiguous) 


29. The lhtter is obtained by maximizing 


inform ist all prices, When price advertising 1s prohibited, consumers must 


(5 — cl t/a) + 20(1 — O)(F — c — 1/2) — 2(a*/2). 


which ocfurs with probability #*. and transportation cost #/2 when receiving 


(An averkge consumer has transportation cost 1/4 when receiving two ads, 
one ad, sf occurs with probability 20{1 — ®),) This maximization yields 


257 ~o-t 


Coe OC al: hae 
2G]— 0 — 3/2 + 2a 
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Because @ > t/2, ©* < 1, The market-determined advertising level exceeds the 
socially optimal level for a close to t/2. In contrast. there is socially too little 
advertising when a and t are small (To see this fix @ and t such that « > t/2. 
and consider the parameters { 4a, i}. ©* is independent of 4, whereas © tends 
to 1 when J tends to 0.) 


30. For more on empirical analyses of advertising. see Telser 1964, Schmalensee 
1972, Comanor and Wilson 1974, Comanor 1979. Lambin 1976, and Porter 
1976, 


31. Schmalensee also states two stylized facts related to entry and concentra- 
tion: “Among consumer goods industries, advertising intensity increases with 
concentration at low levels of concentration; the relation may vanish or change 
sign at high levels of concentration” (stylized tact 5,7), and “Advertising inten- 
sity is negatively related fo entry in manufacturing industries” (stylized fact 
5.9), Grossman and Shapiro (1984) look at free-entry equilibria in their circular- 
city model. 


because ¢xperimenting is costly. Thus, consumers do not 


7.4 Cohcluding Remarks 


The idea|that product differentiation softens price com- 
petition fits well with the observation that firms often 
search fo market niches when positioning their products. 
However] as we have seen, there are limits to differentia- 
tion. Fixed prices, discrete concentration of demand in the 
product space, and cost and demand gains from the ag- 
glomeratipn of firms all may foster product homogeneity. 

Anothér lesson of this chapter is that free-entry equi- 
libria yielH too many or too few firms. Although econ- 
omists generally feel that entry is socially desirable, this 
opinion njay be based on considerations other than those 
discussed|in this chapter, such as the existence of barriers 
to entry (kee chapters 8 and 9). 

Through most of this chapter we have assumed that 
firms are |differentiated along a single dimension (hori- 
zontal, veftical, or informational). Actual differentiation is 
multidimensional. An interesting and largely unexplored 


issue is that of the demand complementarity and sub- 
stitutability of various product characteristics (e.g., quality 
and advertising) and of the optimal strategic mix of these 
characteristics. 

Two other limitations of the models considered in this 
chapter are the firms’ simultaneous entry decision and 
their choice of a single product. Market preemption 
and firm-level brand proliferation are two important real- 
world phenomena that will be explored in the next 
chapters. 


32. See also fo 1984. On price and advertising signaling in a monopoly 
situation, see Fhapter 2 above. 
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33, The formalization of these important issues is complex. See the original 
Papers (cited in the text) for more detail. 
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7.5 Banplasentnry Section: 


Verti¢al Differentiation and Monopolistic 
Competition 


7.5.1| Vertical Differentiation 


Vertical (quality) differentiation was introduced in chapter 
2. Chapter 3 described how a monopolist manipulates the 
spectrum of products he puts on the market to better 
price-fliscriminate. Here we shall consider oligopolistic 
aetfon! under quality differentiation. As with hori- 
zontal differentiation, we will first analyze price competi- 
tion with given qualities (one per firm) and then look at 
the ef ante choice of qualities. The analysis presented 
here |was developed by Gabszewicz and Thisse (1979, 
1980} and Shaked and Sutton (1982, 1983).** Because the 
study of vertical differentiation so closely resembles that 
of hofizontal differentiation, we will concentrate more on 
the shins of divergence. 

Let} the consumers’ preferences be described by U 
= 0s|— pif the consumer consumes one unit (of quality s) 
and phys price p, and by 0 otherwise. The parameter @ of 
taste i quality is uniformly distributed across the popu- 
lation} of consumers between 0 >0 and = 0+ 1. 
The density is 1. 

Thre are two firms. Firm i produces a good of quality 
s;, wiere s; > s,. The unit cost of production is c. The 
cost if the same for both qualities (we will return to this 
assun}ption later). We make the following assumption for 
the moment: 


Assurtption 1 8 > 20, 


This Assumption says, roughly, that the amount of con- 
sumet heterogeneity is sufficient for what follows. We 
also rnake a second assumption, 


Assumption 2. ¢ + e ea —s,) < Os, 

whicl} ensures that in the price equilibrium the market is 
“covered” (that is, each consumer buys one of the two 
brands). 


Let As=s,—s, denote the quality differential, and 
let A =@As and A = @As be the monetary values of 
this quality differential for the highest- and the lowest- 
demand-for-quality consumers. 

We first consider price competition. We look for an 
equilibrium in which the market is covered and both firms 
compete for consumers. High-@ consumers buy the high- 
quality good; low-@ consumers buy the low-quality good 
(which must be priced lower to attract any consumer). A 
consumer with parameter @ is indifferent between the 
two brands if and only if 0s, — p, = 05; — pz. This 
yields the following demand functions (see chapter 2): 
Pixs) 6 


gfe 
Dy (pi, P2) = is a 


=p9-B—A 
D,(py,P2) = 8 As 


In Nash equilibrium, each firm i maximizes 
(pi — D,( pi. P)) 


with respect to p;. 
The reaction functions are 


P2 = Rap) = (py +o + AY/2 

and 

Pr = Ry(p2) = (p2 + ¢ — A)/2. 

The Nash equilibrium satisfies pf = R,(pj), which implies 


A-2A_ , 6-28 


poset (qe As 

and 

pact Ao, Be OAs > pe 
These yield demands 


Ds = (8 — 2.8)/3 


and 


Sutton 1982 particularly closely. We will use a different representa- 
tion of donsumer preferences. For consistency, we will stick to the preferences 
used in Fhapters 2 and 3, Bonanno (1986) and Gal-Or (1983) consider vertical 


34. cif Gabszewicz, Shaked, Sutton, and Thisse 1981. This section follows 
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product differentiation under quantity competition rather than price competi- 
tion and report that, in general, Cournot competition does not lead to less 
product differentiation than price competition. 
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Ds = (20 — @)/3 

and profits 

T1*(4,s2) = (@ — 28)?As/9 
and 


T?(s},5) = (20 — 0)7As/9 


low-quality producer. It also makes a higher profit. 

Ag with the horizontal model, undifferentiated firms 
(As = 0) charge their marginal cost and make no profit. 
When we look at the choice of quality, we will thus 
obtaih the principle of differentiation. 

Cqnsider now a two-stage game in which firms first 
compete in quality (one per firm) and then compete in 
price Assume for the moment that the choice of quality 
is fs Suppose further that s; must belong to [s,5]. 


op high-quality firm charges a higher price than the 


wher¢ s and 5 satisfy assumption 2.°% In the first stage, 
firm maximizes I1'(s,,5,) over s;, and similarly for firm 
2. Letjus look for pure strategies. Because undifferentiated 
firms jnake no profit, s; and s, will differ in equilibrium. 
Suppdse, e.g., that s; < s,, Because both firms make more 
profit} when they are more differentiated, firm 1 gains 
from feducing its quality toward s and firm 2 from in- 
creasifg its quality toward 5. Hence, there are two pure 
Nash equilibria in location: {sf = s.s5 = 5} and the one 
obtairled by reversing the firms’ indices. Both equilibria 
exhibit maximal differentiation. The intuition is the same 
as for|the spatial model: Firms try to relax price competi- 
tion through product differentiation. Of course, if one of 
the firms entered first (sequential choice of quality), that 
firm would choose the high quality = and the other the 
low gpality s, so the equilibrium would be unique. This 
suggests the possibility, in real time, of both firms trying 
to be|first. To describe such a preemption game, one 
would] introduce a (possibly time-decreasing) cost of in- 
trodudng each quality, and possibly a rate of growth of 
(formalized, for instance, by the rate of growth of 
the density of consumers). The firms then would face a 
trade-gff between obtaining a leadership position (ie., 


occupying the most profitable niche) and introducing the 
good too early (see chapters 8 and 10 for such preemp- 
tion games). 

The maximal-differentiation result is interesting be- 
cause it formalizes the effect of strategic behavior in an 
extreme way. Even though quality is costless to produce, 
the low-quality firm gains from reducing its quality to the 
minimum because this softens price competition (lower 
quality otherwise reduces demand). This result is, how- 
ever, not very robust. In particular, if the lowest quality is 
fairly low (i.e. if assumption 2 is invalid) the low-quality 
firm ends up facing no demand; this effect prevents 
maximal differentiation (it is reinforced if marginal pro- 
duction costs differ, so that price competition is less in- 
tense). But the principle of differentiation is more robust. 

An interesting phenomenon arises when assumption 1 
is violated, Suppose that @ < 24 (low consumer hetero- 
geneity). Then, in the price equilibrium, firm 1 faces no 
demand. It charges ¢, and firm 2 charges ¢ + (A/2), Firm 
1 makes no profit, and firm 2 makes profit A/2. Thus, 
even though we assumed costless entry and constant 
retums to scale, there is only one firm making positive 
profits in the markets, and the other firm does not gain 
any share of the market. This property contrasts with the 
location model. In that model, under costless entry, a firm 
always enters and has a positive market share. By charg- 
ing just above the marginal cost, it always makes a posi- 
tive profit from consumers located nearby (given that its 
rivals never charge under the marginal cost), With low 
consumer heterogeneity, intense price competition drives 
the low-quality firm out, The intuition is that if the 
lower quality is “low” it cannot compete with the higher 
quality, whereas if it is “high” (close to the higher quality) 
it triggers rough price competition, which swamps the 
increase in demand associated with the increase in quality, 

More generally, Shaked and Sutton (1983) show the 
following “finiteness result’ **: Suppose that quality s 
costs c(s) per unit (c was constant in our example), Sup- 
pose further that, if all qualities were produced and sold 
at marginal cost p(s) = c(s), all consumers would buy the 
highest quality.77 Then there can be at most a finite num- 


35, That]is, © + [(@ — 20)/3]( —s) < As. This implies that assumption 2 is 
satisfied fr any quality choices. 


30. The | Sutton model actually differs from this one in two respects. 
First, they use a different family of consumer preferences: second, they study 
three-stage game, with sequential entry, quality, and price decisions 
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37, Consumer # would maximize 4 ~ c/s). Assuming ¢ > 0 and ©” > 0, the 
condition is equivalent to @ > c'G); that is, the lowest-valuation-for-quality 
consumer would choose the highest quality 3 This is satisfied in particular in 
‘our model, in which c’ = 0. More generally, the assumption is that potential 
‘qualifies are ranked in the same way by all consumers if sold at marginal cost 
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ber of {rms with positive market share in the industry 
bea of the relative sizes of demand and entry 
costs). Again, price competition among the high-quality 
firms dr}ves prices down to a level at which there is no 
room fdr low-quality products. This result must be con- 
trasted with that obtained in the location model of section 
7.1, in which, when the entry cost tends to zero or when 
the sendiey of consumers tends to infinity, the equilibrium 
numberof firms tends to infinity (and the prices converge 
to marginal cost). 

In a|sense, the finiteness property requires that the 
margind| cost of quality not increase too quickly with 
quality.|Shaked and Sutton thus conclude that this finite- 
ness prdperty is more likely to hold if the main burden of 
quality resets falls on research-and-development 
costs of fixed costs rather than on more variable costs, 
such as those of labor and raw materials. 


7.5.2 |A Symmetric Model of Monopolistic 
Competition 


In the Horizontal- and the vertical-differentiation model, a 
produc} competes more with some products (its close 
neighbdrs in the product space) than with others. The 
purpos¢ of this subsection is to introduce a model (due to 
Dixit ahd Stiglitz [1977] and Spence {1976]) in which 
there is|no such asymmetry in the substitutability of vari- 
ous prdducts in the industry. Other differences with the 
horizor{tal- and the vertical-differentiation model are that 
there efists a single, representative consumer (i.e., there is 
no het¢rogeneity in taste) and that this consumer con- 
sumes } little bit of every available good instead of con- 
suming] only his most preferred product. 

The model has two sectors. The utility function of the 
“represpntative” consumer has as its two arguments qo 
(the quhntity consumed of the unique good produced by 
the first sector*®) and a “subutility function” that depends 
on the fonsumptions q, of all goods i in the second sector 
(which|is called “the sector of differentiated products”). 
More frecisely, 


1 o(0)") 


Thus, the subutility function for the differentiated goods 
has the constant-elasticity-of-substitution (CES) form. We 
assume U to be concave, which, in particular, requires that 
p <1.If p;is the price of the differentiated product i, the 
representative consumer maximizes LU subject to the bud- 
get constraint 


FS 
Go + py pia <1, 


where I is the (exogenous) income of the representative 
consumer.*? 

The number of potential producers in the differentiated 
sector is infinite. Each producer i is identified by a good i. 
The production of the differentiated good i involves a 
fixed cost f and a marginal cost c, both denominated in 
terms of the numéraire. 

Because of the fixed cost, only a finite number n of 
differentiated goods will be produced (of course, nf < I). 
To simplify calculations, we will assume that n is large. 
Finally, we will make the free-entry assumption, so that 
the profit of entering firms is zero. For this reason, 
profit is not relevant to the income of the representative 
consumer, 

As was mentioned earlier, the utility function chosen is 
very specific, as it treats all the differentiated products in 
a symmetric way. When a firm introduces a product, it 
does not choose its degree of differentiation relative to 
other products, The utility function used here is thus 
somewhat abstract, but it allows us to concentrate on the 
entry decision (“0 or 1”) without complicating this by a 
simultaneous choice of “location.” 

Maximization of consumer utility with respect to qj 
yields (after substitution of the budget constraint into U) 


" Lent 
Up = u,( 2 9) ae, (7.18) 


where U, is the partial derivative of LI with respect to its 
h® argument. Since n is large, a change of q, has little 
effect on 


Lar 


pat 


3B. This fnique good is taken as the numéraire 
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39. For example, we may assume that the consumer generates J units of the 
nondifferentiated good by his labor. 
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and, therefore, little effect on U, and U;. The resulting 
demand function for product i can be approximated as 


i = K(k > 0). 


Ther¢fore, the demand elasticity for product i is approxi- 
mately 


. =| /"— 
; Opi Pi 


p 
The ia case where p = 1 corresponds to the sit- 
uation where products are perfect substitutes for one 
another, 

The producer of good i, if he decides to enter, chooses 
p; in prder to maximize his profit: 
max |p, — oa; — f). 


% 


It follows that 
pl + Ie) =e 
(see dhapter 1), or 


p= Vo. (7.19) 


The less substitutable are the differentiated products, 
the higher the price. Now, we determine the number of 
firms|]n by imposing the zero-profit condition, Given 
the symmetry of the problem, all the firms of the differ- 
entiated sector produce the same quantity: q, = q. Hence, 
the z¢ro-profit condition can be written as 


(c/p + cq =f. (7.20) 
Using equation 7.18, we derive 
LS bs Ugg? (ng?) 
or 
neq v2 ncq 

cU, (fT ——, nq) = np u,(1—™._ neg), 

1 ? ’) e n( ? n ‘) 

(7.21) 


Tht problem is now solved: q is given by equation 
7.20; pnd after the substitution of q, equation 7.21 deter- 


mines the number of firms, n. Let (q°,n°) be these two 
numbers. 

We now compare this free-market outcome with that 
chosen by a social planner. There are several assumptions 
that can be made concerning what the social planner can 
do. One possibility (considered in different models 
by von Weizsacker [1980] and Mankiw and Whinston 
[1986}) is that he can control only entry (i.e., n). Another 
is that he can also regulate prices (ie, n and q). One 
cannot, in the abstract, determine the relevant benchmark, 
which depends on the social planner’s feasible set of inter- 
ventions. We here consider the “first-best” benchmark, in 
which the social planner chooses both the level of entry 
(n*) and the output of entering firms (q"). To be sure, the 
planner would price at marginal cost c. He would finance 
the fixed costs {n*f} by a lump-sum tax on the con- 
sumer's income. Then the consumer would choose the 
quantity q of each differentiated product (identical, from 
the fact of the symmetry of the problem) in order to 
maximize as follows: 


max L(I — nf — neq, qn”). 
4 


Consequently, the planner would choose » in order 
to maximize the corresponding indirect utility function. 
Therefore, it suffices to maximize U with respect to q and 
n (for which we can use the “envelope theorem”), Then 
we would obtain two equations in two unknowns, from 
which we would solve for q* and 1”. 

After these calculations, it is possible to compare q° 
with q*, and n° with n°, For practical purposes, taking a 
simple functional form, as Dixit and Stiglitz have done, 
facilitates these calculations. Those authors show that the 
comparison between q* and q* depends crucially on the 
derivative of the “rate of appropriability of the surplus by 
the firm” in their example. This rate, 1(q), is defined as 
the ratio of the firm's total revenue to the gross con- 
sumer surplus generated by the introduction of the dif- 
ferentiated product: 


a 
Sq) S(q) © 


In their example, q° is larger or smaller than q* depending 
on whether y increases or decreases with q.*° Quite 


4q) = 


40. Digit and Stiglitz's example is slightly more general with regard to the 
second argument of LI. They use ¥7-, v{q,) for this argument. For the utility 
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function used above. we can show that j(q) = p. In this case y'(q) = 0 and 
=a. 


Product Differentiation 


Exercist 7.2 
(i) The demand functions are given by 
Dip pj) = (p; — ps + B/2E. 
(See chapter 2.) Firm i maximizes 
(pi = Di pie Py), 
which yjelds 
pi = Rillp;) = (py + t + &)/2. 
The Berfrand-Nash equilibrium satisfies 


pi = RAR (pL 
or 
pileyc) t+ 22S, 


3 


The redjiced-form profits are 


—a\t 
T1(c,, ¢)) = (: + 154) [ut 


(ii) This is straightforward. 
(iii) I the first stage, firm i maximizes 


[pilei, e)] — ell pylcy, ¢)) — pulcy,c)) + HV/2t — o(c;). 


Using the envelope theorem, we can ignore the deriva- 
tive with respect to p;. We are thus left with two terms 
(besideg the marginal cost of investment, ¢'(c,)): — D, (the 
direct effect: a unit cost reduction operates on D; units of 
demand) and 


6 
= 4, 
(pi — Gy 7c, 2t>0 


(the strategic effect: firm i’s cost reductions imply a de- 
crease ih p;, and therefore a decrease in p, because the 
reaction curves are upward sloping). 


Exercise 7.3 


Assum@ that there are n firms. Consider firm i's choice 
of p;, given that the other firms charge p. A consumer 
located |at a distance of x < 1/n from firm i is indifferent 
between firm i and the nearest competitor if 


py + te) = p + KI/n — x)?. 
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This yields the following demand function: 


Sian teen 
D\pup) = 22 == ; . 


Maximizing (p; — c)D, and using symmetry yields 
p=ct+t/n’?. 
Profits per firm are 


n-({)i-r- 


The zero-profit condition implies n° = (t/f)'? and p* 
=c + 73%. To get the social optimum, minimize total 
costs (because of unit demands and covered market, there 
is no distortion of consumption due to market power); 


13 


12n 


, came t 
min(nf + 2nt ‘ Pur) = min + an) 
This yields n* = [t/6f]"*. 

Exercise 7.4 


See Salop 1979. 
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Entry, Accommodation, 
and Exit 


In the preceding chapter we saw how fixed costs (or, 
more generally, increasing returns) generate an imper- 
fectly competitive market structure by limiting entry, 
However, even when fixed costs do restrict entry, posi- 
tive (supranormal) profits are not ensured. Indeed, in the 
free-entry equilibrium, the firms make zero profit (up to 
the integer problem). In order to explain why the profit 
rate is systematically greater in certain industries than in 
others, some type of restriction to entry must exist in 
these industries to prevent other firms from taking advan- 
tage of the profitable market situations. Along these lines, 
Bain (1956) defined as a barrier to entry anything that 
allows incumbent firms to earn supranormal profits with- 
out threat of entry.' 

Occasionally government restricts entry—for exam- 
ple, by introducing permits, licenses, patents, and taxi 
medallions. These restrictions may generate above-normal 
profits.? Other examples include the use of certain govern- 
ment purchasing policies or the granting of import licences 
(in situations that are not already domestically competi- 
tive, perhaps because of significant fixed costs) to form 
domestic monopolies.* In this chapter we consider barriers 
to entry not created by government. 

Bain (1956) informally identified four elements of mar- 
ket structure that affect the ability of established firms to 
prevent supranormal profits (rents) from being eroded by 


entry; 


1. Stigler (1968) offered an alternative definition based on cost asymmetries 
between inelimbents and entrants. Von Weizsicker's definition (1980a, p. 400) 
that “a barrler to entry is a cost of producing that must be borne by a firm 
which seeky to enter an industry but is not borne by firms already in the 
industry and that implies a distortion in the allocation of resources from the 
social point|of view" is related to Stigler’s. For comprehensive treatments of 
barriers to ehtry, see Encaoua et al. 1986 and von Weizsacker 1980b. 


2, In New fork, a taxi medallion sells for $100,000. That can be interpreted 
as the pres¢nt discounted value of the positive profits to be earned in the 
market, entry into which is legally restricted. 


3, Another| institutional barrier to entry may well be the lags and costs 
imposed by regulatory processes. For instance. MCI spent $10 million in 


regulatory and legal costs and waited seven years to gain permission to 
construct a microwave system, which cost $2 million and took seven months to 
complete. The established regulated firm, AT&T, which had a staff of lawyers 
and economists expert in regulatory matters, skillfully argued there was no 
need for the new service and that MCI only intended to enter the profitable 
part of the market, which, AT&T claimed, was used to subsidize some less 
profitable services (“cream skimming”). For criticisms of the Noerr-Pennington 
doctrine, which (particularly in the AT&T case) shields businesses from liability 
for their participation in governmental proceedings, see Brock and Evans 1983, 
and Brock 1983b. Those authors argue that business’ interference in the 
regulatory process may be pure waste and that, because abuses are unlikely to 
be caught, this interference (called “regulatory-process predation”) should be 
dealt with severely. 


Economies pf scale (e.g., fixed costs) Bain argued that if the 
minimum efficient scale is a significant proportion of the 
industry demand, the market can sustain only a small 
number of firms that make supranormal profits without 
inviting entry. This argument is examined in section 8.1, 
where we examine natural monopoly or oligopoly situa- 
tions and] the theory of contestability. See also section 
86.1. 


Absolute dost advantages The established firms may own 
superior production techniques, learned through expe- 
rience fing by doing) or through research and de- 
velopmeyt (patented or secret innovations). They may 
have acchmulated capital that reduces their cost of pro- 
duction hey may also have foreclosed the entrants’ ac- 
cess to ciucial inputs through contracts with suppliers. In 
etn and 8.6.1 we consider the accumulation 
of capital by incumbents. Section 4.6.2 examined the 
market-fdreclosure doctrine. R&D activity is studied in 
chapter 0, 

Product-d}fferentiation advantages Incumbents may have 
patented] product innovations (which, of course, can be 
seen as a cost advantage relative to the product), or they 
may havp cornered the right niches in the product space, 
or asi eh enjoy consumer loyalty. (The niche argu- 
ment is ¢xamined in section 8,6.2.) 

Capital sci According to this controversial ele- 
ment of entry barriers, entrants may have trouble finding 
financing for their investments because of the risk to the 
ccesitord One argument is that banks are less eager to 
lend to fntrants because they are less well known than 
incumbents; another (which will be examined in section 
9.7) is that entrants may be prevented from growing as 
incumbgnts inflict losses on them in the product market in 
order td reduce their ability to find financing for new 
investmpnts. 


Bain #lso suggested three kinds of behavior by incum- 
bents inj the face of an entry threat: 


Bloch entry The incumbents compete as if there were 
no threat of entry. Even so, the market is not attractive 
enough|to entrants. 


Deterred entry Entry cannot be blockaded, but the in- 
cumbents modify their behavior to successfully thwart 
entry. 

Accommodated entry The incumbents find it (individually) 
more profitable to let the entrant(s) enter than to erect 
costly barriers to entry. 


Bain’s suggestions obviously begged for further analy- 
sis. The most famous model of barriers to entry is the “limit 
pricing model” (Bain 1956; Sylos-Labini 1962; Modigliani 
1958), the basic idea of which is that, under some circum- 
stances, incumbent firms may sustain a price so low that 
it discourages entry. This story remained controversial 
until Spence (1977), Dixit (1979, 1980), and Milgrom and 
Roberts (1982) clarified its underlying aspects.* Very 
roughly, the Spence-Dixit reconsideration (section 8.2 
below) offers to regard the Stackelberg model of sequen- 
tial quantity competition as one of sequential capacity 
choices, That is, although product-market competition (if 
any) determines the market price in the short run, in the 
longer run firms compete through the accumulation of 
capacity. (See chapter 5 for the reinterpretation of quan- 
tities as capacities,) An incumbency advantage (the pos- 
sibility of early capital accumulation) leads the incumbent 
firm to accumulate a large capacity (and therefore to 
charge a low price) in order to deter or limit entry. 
The Milgrom-Roberts reconsideration of the limit-pricing 
story (studied in chapter 9) is based on the asymmetry of 
information between the incumbent and the entrant. In 
their model, the incumbent charges a low price not be- 
cause he has a large productive capacity (capacity con- 
straints play no role there) but because he tries to convey 
the information that either the demand or his own mar- 
ginal cost is low, thus signaling a low profitability of 
entry to the potential entrant(s). These two models have 
fairly distinct positive and normative implications. 

Erecting barriers to entry is only one aspect of strategic 
competition. Inducing exit of rivals is another. And even 
if neither entry nor exit is at stake (the “accommodation” 
case), firms battle for market shares. Chapter 6 examined 
examples of such battles, in which firms repeatedly com- 
pete in price. Firms also compete in non-price aspects 


controversy is due to the fact that the timing of the underlying 
strategic struments were nat completely described (for instance. 
the “SylogLabini postulate” holds that potential entrants expect established 


firms to maintain the same output if entry occurs, yet the story ts named 
“Uimit-pnicing”). nor was the commitment value of either quantity or price 
carefully examined. 


(capacities, technology, R&D, advertising, product differ- 
entiatipn, etc.). Chapters 5 and 7 offered examples of 
non-price competition, but there we focused on once-and- 
for-all |static) situations in which firms choose their non- 
price variables simultaneously; the important possibility 
of infldencing, rivals’ subsequent non-price behavior was 
ignored. This chapter examines strategic interaction in a 
Sak context. 

Thee are a variety of business strategies available to a 
firm, depending on whether it wants to deter entry, to 
induce |exit, or (if those goals are too costly) to do battle 
with itp rivals, As we will see, optimal strategies also 
depiadien whether reaction curves are sloping upward 
(strategic complements) or downward (strategic substi- 
tates Rector 8.3 offers a taxomony of relevant business 
strategies, all of them meant to soften the rivals’ behavior. 
Section] 8.4 applies these strategies to a number of strate- 
gic situptions, 

The excellent surveys of Gilbert (1986, 1987), Kreps 
and Spence (1984), Shapiro (1986), and Wilson (1984) 
address some of the points raised in this chapter. Much 
of the |material of this chapter and the following one 
is derived from Fudenberg and Tirole 1986 (see also 
Fudenberg and Tirole 1984). Section 8.1 draws from 
Fudenbprg and Tirole 1987. 


8.1 Fixed Costs: Natural Monopoly and 
Conte: y 


This seftion addresses the role of fixed costs as a barrier 
to entry. Recall Bain’s argument that under increasing 
returns |to scale, only a finite number of firms are viable, 
and thee firms make positive (supranormal) profits with- 
out triggering entry—for instance, if potential entrants 
know that a duopoly yields negative profits, an estab- 
lished firm can quietly enjoy a monopoly profit without 
worrying about the threat of entry. This conclusion was 
pensit by Baumol, Panzar, and Willig (1982), who 
argued that having one or a limited number of firms does 
not medn there is no competition and that potential com- 
petition] (the threat of entry) may serve to discipline 
established firms.* 


8.1.1 Fixed Costs versus Sunk Costs 


In a one-period (i.e., timeless) view of the world, a fixed 
cost is easily defined as a cost that a firm must incur in 
order to produce and that is independent of the number 
of units of output. For instance, a firm may incur cost 
C(q) = f + cq for q>0 and cost C(q) =0 for q=0. 
(Fixed costs are instances of increasing returns to scale. 
See the chapter on the theory of the firm for the notions 
of subadditivity and natural monopoly.) The timeless 
model of production is, of course, an abstraction. Once 
time is introduced, one must carefully define the notion of 
production period. To see this, suppose (with Weitzman 
[1983]) that a firm produces output q > 0 per period in 
two consecutive periods at cost 2(f + cq), where f is the 
per-period fixed cost. Absent entry and exit costs, it 
would be cheaper to produce output 2q in the first period 
and 0 in the second. This would cost f + 2q and save f, 
(We ignore interest and storage costs, assuming that the 
lag between the periods is short; we also ignore uncer- 
tainty about future demand, which may lead firms to wait 
to produce future supply.) More generally, dividing the 
production period by 2 and doubling the production in- 
tensity saves on fixed costs, so that all production should 
take place over a very short interval of time and fixed 
costs should be negligible relative to variable costs. To 
avoid this extreme conclusion, it is important to realize 
that fixed costs are always sunk to some extent. The 
presence of market imperfections prevents instantaneous 
rental of capital or hiring of labor. Or the firm may need 
to buy up front specific investment that has no intrinsic 
value to other firms (and therefore has no value on a 
second-hand market) and cannot be allocated to another 
use within the firm. 

We will define fixed costs as costs that are independent 
of the scale of production and are locked in (committed, 
sunk) for some short length of time, which defines the 
“period.” For example, suppose that deciding to produce 
a positive quantity requires a firm to immobilize machines, 
capital, land, legal, public relations, and advertising ser- 
vices, and general staff for one month, The firm cannot 
get away with incurring half of the relevant fixed costs 
and doubling its production rate during fifteen days, stop 


5. See Baujnol et al. 1982 for further references, See also Brock 1983a, Spence 
1983, Baurgol et al. 1986, and Schwartz 1986. 
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production] and save the remaining half during the second 
fortnight (pnd possibly resume production thereafter). 
Thus, one fan envision a discrete-time model in which a 
firm incurs]a cost of f + cq in each period if it produces 
at that dat? and zero otherwise, The real time length of 
each noes the length of time over which the 
cost is incufred.° 

The distinction between “fixed costs” and “sunk costs’ 
is one of dbgree, not one of nature. Fixed costs are sunk 
only in the short run. (Of course, there is the question 
of how shprt the short run is, and how the length of 
He setcalh to investments compares with the time scale 
of product] competition, e.g,, of price changes. We will 
come back fo this issue when discussing the contestability 
theory.) Suhk costs are those investment costs that pro- 
duce a strgam of benefits over a long horizon but can 
never be retouped. A machine will be labeled a fixed cost 
if the firm Jrents it for a month (or can sell it without 
capital loss|a month after its purchase) and a sunk cost if 
the firm is qtuck with it. 

The notipns of fixed and sunk costs are idealizations 
for several]reasons. First, there is clearly a continuum 
of degrees pf commitment between these two polar cases 
of short ani eternal commitment. Second, both notions 
assume thaf the investment cost cannot be recouped at 
all during the commitment period (whatever it is), In 
practice, a Machine would have some value lower than 
its original value on the second-hand market. Also, leas- 
ing and labpr contracts can be breached at some penalty 
cost. Thus,] commitment is not quite an all-or-nothing 
notion, Whit we really mean by period of commitment is 
a period chine over which the cost of being freed from 
the commitinent within the period is sufficiently high that 
it does not pay to be freed, For simplicity, we will content 
ourselves with assuming that investment costs are com- 
pletely sunk for the whole period. Third, and a related 
point, our nption of commitment is largely a purely tech- 
nological ole (though filtered through the existing set of 
input-market institutions). In practice, the date at which a 
firm resells jits assets or modifies its rental or labor con- 
tracts may 4lso depend on how well the firm is doing in 
the product|market and on strategic considerations in this 
market 


8.1.2 Contestability 


Following Baumol et al. 1982, let us consider a homo- 
geneous-good industry with » firms. All firms have the 
same technology, and producing output q costs C(q) with 
C(0) = 0. We split the set of firms into two groups: m 
“incumbents” (without loss of generality, we can assume 
that the incumbents are firms i= 1,...,m) and n—m 
> 0 “potential entrants.” 

An industry configuration is a set of outputs {q,,..., 
4m} for the incumbents and a price p charged by all 
incumbents (the potential entrants stay out of the market). 

The industry configuration is feasible if the market clears 
(ie. if total output is equal to total demand at price p: 
YM 41 = D(p)) and if firms make non-negative profits 
(for any incumbent firm, pq, > C(q;)). It is sustainable if no 
entrant can make a profit taking the incumbents’ price as 
given (there do not exist a price p* < p and an output 
4° < D(p*) such that p®q® > C(q*)). 

A perfectly contestable market is one in which any equi- 
librium industry configuration must be sustainable. 

These definitions extend straightforwardly to multi- 
product technologies; it suffices to allow outputs and 
prices to be multidimensional vectors. Indeed, the theory 
of contestability has been partly motivated by multi- 
product technologies, and some of its interesting devel- 
opments are related to the issue of “cross-subsidization.” 
(See footnote 7 below.) 

Here we will content ourselves with an exposition of 
the single-product case. 

To illustrate the concept of sustainability, let us consider 
our standard example of increasing-returns technology: 


Clq) =f + og. 
Let 


fi" = max{[P(q) — clq} 
4 


denote the monopoly profit gross of the fixed cost. As- 
sume that a monopoly is viable: f[I™ > f Figure 8.1 de- 
picts the unique sustainable configuration in this industry. 
There exists only one incumbent in the industry, charging 
price p* and supplying output 4°. The other firms stay 
out. The contestable price-output pair { p*, q°} is obtained 


6. See page 363) of Baumol et al. 1986 for a more complete discussion of this 
point, 
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Figure 8.) 


from thd intersection of the average-cost curve and the 
demand furve: 


(pS — )P(p*) = f. 


A fimmp that charges p <p‘ and produces a positive 
quantity loses money, because its price is below the aver- 
age cost] (This also shows that the contestable price is 
smaller stash the monopoly price p™.) Conversely, a price 
above p‘Jis not sustainable, because an entrant can under- 
cut this price and still make a strictly positive profit. 

In thi} example, the theory of contestability predicts 
the following conclusions: 


(1) Therd is a unique operating firm in the industry (tech- 
nological efficiency). 


(2) This frm makes zero profit. 


(3) Average-cost pricing prevails. Furthermore, the alloca- 
tion is constrained efficient, in the sense that it is socially 
efficient, given the constraint that a social planner does 
not use subsidies.” 


Thus, the mere “threat of entry” has an effect on the 
market behavior of the incumbent firm (conclusion 2 and 
first part of conclusion 3). The second part of conclusion 
3 is not surprising. The fixed cost is not duplicated in the 
sustainable outcome. Thus, only the market price matters 
in the assessment of efficiency. Clearly, the first-best out- 
come is obtained when the incumbent charges the mar- 
ginal cost; however, in the absence of a subsidy, the firm 
would lose f and would not be willing to operate. Short 
of the first-best outcome, a social planner prefers the 


lowest price that allows the firm to make a non-negative 
Ff 8 


set of conclusions is striking. It has long been 
argued that an industry subject to non-negligible increas- 
ing returns could not behave competitively and therefore 
should be nationalized, or at least carefully regulated, 
If, however, such an industry behaves like a perfectly 
contestable market, it comes as close to marginal-cost 
pricing as is consistent with viable firms (if subsidies are 
prohibited). In the absence of actual competition, poten- 
tial competition is very effective in disciplining the 
incumbent firms. Hence, the unregulated organization of 
industries with increasing returns to scale should be less 
of a problem than would appear at first glance. Clearly, 


tiproduct case, Baumol et al, (1982) show that a sustainable 
it exists, satisfies the following conditions: (a) Industry cost 
holds (a generalization of conclusion 1). (b) Firms make no profit 
(c) The revenue made by a firm on a subset of products is at least 


ginal production cost for any firm that produces it. They are 
than one firm supplies the good. (e) Under some assumptions (see 
1977), Ramsey prices and outputs—ie. those that are welfare 
to the constraint that the firm eams a profit ‘equal to the 


for condition c (the no-cross subsidization result) is that if a set 
re not viable, an entrant could come in with the same production 
t except that it would drop these products and thus make 


. we must check that the social planner could not do better by 
to randomize between different prices. To see that price 
lowers welfare. it suffices to show that the aggregate welfare 


Ep), and welfare is higher under the deterministic price Ep than 
lom price p (where £ denotes expectation over the price). If 


furthermore, the profit function Mp) is concave in p, the firm makes non: 

negative profits under the deterministic price Ep if it makes non-negative prot 

under the random price p (since (Ep) 2 E Mp) 2 0), so the firm's now 

negative profit constraint is harder to satisfy with a random price than with a 
deterministic one, For our purpose, let us assume thal 


Dip) +(p—aD"p) <0. 
Then 
T'(p) = 2D(p) + (p — AD*(p) <0. 
Also, 
W'(p) = [S(p) + Mp” 
=|=Dip) + Dip) + (p— AD‘(pit 
= Dip) + (p—aD%(p) <0 
(where S denotes the net consumer surplus), Thus, both [and W are concave 


For a much more general result on the undesirability of random prices. see 
Samuelson 1972. 
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such a thdory, if applicable, has strong implications for the 
deregulation of the airlines and similar industries. 

Baumol et al. (1982) show that, for different demand 
and cost ffunctions, natural monopolies may not be sus- 
tainable. [That is, there may not exist a price-output pair 
{p*,q°} such that firms make non-negative profits, the 
market clpars, and the allocation cannot be upset by pro- 
fitable entry at price-output pair {p*,q*} such that p* 
<p* and|q* < D(p*). That is, constrained efficient market 
structured may not be sustainable against entry. 


Exercise 4.1°* In a one-good industry, consider a U- 
shaped average-cost curve. Suppose that the demand 
curve intersects the average-cost curve slightly to the 
right of the most efficient scale (i.e, the average-cost- 
minimizing, output). Using a diagram, show that there 
exists no|sustainable allocation. 


The natural question is this: Which situation is depicted 
by the contestability axioms—in particular, the sustain- 
ability ayiom? One would want to describe (at least in a 
stylized way) competition in a natural-monopoly indus- 
try, and to compare its outcome with the contestable one. 

One e that yields the contestable outcome is the 
owing Sars that firms first choose prices simul- 
taneously and then choose outputs. (Picking an output 
involves |deciding whether to enter—i.e., whether to 
choose | strictly positive output.) This two-stage game 
is the reverse of the two-stage game described in chapter 
5, in which firms chose quantities before prices. Sup- 
pose a all potential firms choose price p*. Then one of 
them chdoses output q* and the others stay out (produce 
nothing)| This is clearly an equilibrium. All firms make 


zero profit, If a firm were to undercut p*, it could not 
supply the market profitably.? As Baumol et al, rightly 
note, the theory of perfectly contestable markets can thus 
be seen as a generalization of Bertrand competition to 
markets with increasing returns to scale.'° 

The preceding game portrays the vision of an industry 
in which prices adjust more slowly than decisions about 
quantities or entry. Prices are considered rigid at the 
time firms choose their quantities. As prices are generally 
thought of as amenable to relatively quick adjustment, the 
technology thus involves a fixed cost in the sense of 
subsection 8.1.1. This vision is implicit in the slightly 
more sophisticated “hit-and-run entry" story offered by 
the proponents of contestability. Suppose that the incum- 
bent's price is rigid for a length of time t, and that entry 
and exit are costless. If the incumbent's price exceeds p‘, 
an entrant can enter, undercut p* slightly (thereby con- 
quering the incumbent's entire market share), and exit 
the industry before t units of time having elapsed—ie., 
before the incumbent can respond by lowering his price. 
The entrant (who, by assumption, incurs no entry or exit 
cost) thus makes a positive profit. Thus, only price p* is 
“sustainable.” 

This interpretation of contestability has come under 
attack on the grounds that prices seem to adjust more 
rapidly than decisions about quantities or entry. Price 
adjustment does seem faster in the railroad industry, 
where entry and expansion entail a long-process of buying 
up parcels of land (generally requiring powers of eminent 
domain), engineering and building the railroad, and so on. 
It may be even faster in the airline industry, where open- 
ing a new route is a relatively fast process.''-!? 


9. To prov} that this is the unique equilibrium, consider the highest price 
7’ > p* charked in equilibrium by any firm. Show that this price has probability 
1 of being pirictly higher than the lowest price charged by the other firms. 
‘Conclude that this firm makes zero profit, which in turn implies that the lowest 
price charge by the other firms is p* with probability 1 


one-good ifdustry. Grossman assumes that firms announce supply curves 


10. See Grbssman 1981 for an altemnative approach to contestability in a 
rather than fi 


11. Howevfr, Bailey and Panzar (1981) argue that the theory of contestable 
markets is rflevant to city-pair airline markets, There are returns to scale in this 
industry, buf fixed costs are not sunk. (The aircraft can be recovered at little 
cost. Sunk dosts, such as those for runways, towers, and ground facilities, are 
incurred by municipalities.) Bailey and Panzar offer some evidence that 
monopolist (almost 70% of routes are served by a single carrier) behave more 
or less competitively on their long-haul routes immediately after deregulation. 
In contrast, Bailey et al. (1985) and other find that fares are higher when 
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concentration is higher when all else is equal (but the relation, although statis- 
tically strong, is not economically large). 


12. Brock and Scheinkman (1983) study “quantity sustainability.” They say 
that a price-quantity allocation (7,4) is quantity sustainable if any production 
plan by an entrant g* makes negative profit at the market-clearing price for 
quantity 7 + q*. That is, the entrant assumes that the established firm's output 
remains fixed after entry. Brock and Scheinkman show that under some 
assumptions price sustainability implies quantity sustainability, and that in the 
single-product case the allocation (*.q*) at which the demand curve intersects 
the average-cost curve is quantity sustainable (it is not necessarily price 
sustainable—see exercise 8.1), 

Perry (1984) considers price strategies but departs from the uniform-pricing 
assumption made by Baumol et al. (1982), The incumbent announces a price- 
quantity schedule: He stands ready to supply 4, units of the good at price p,. 
then g; more units at price p; > p, (so that his total supply at price p, is 
1 + 42), and so on. The entrant reacts by announcing a price-quantity schedule 
himself. Sustainability is easier to obtain than under uniform pricing, because 
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ion facilities. That is, entry ought to induce the 


idities. In this view, prices adjust “instantane- 
his, of course, is not realistic; it is a metaphor for 
that prices adjust quickly relative to the time 
scale of the capacity game.) That is, at any point in time, 
each firth chooses its price so as to maximize its profit, 
given thp current vector of capacities. 

An olf intuition in industrial organization states that if 
the inculnbent is committed to his capacity only in the 
short run, he and the potential entrant are almost on equal 
footing, |so that barriers to entry (and the incumbent's 
profit) ane low, Indeed, in a model where firms are stuck 
with thelr capacity choices for a short period of time, it 
can be hank that there exists an equilibrium in which 
only the incumbent produces; this apparent monopo- 
list accumulates and constantly renews (approximately) 
capacity |q° and makes (almost) no profit. If the incum- 
bent’s equilibrium capacity were lower (allowing posi- 
tive prot, an entrant could come in and, because the 
froumbertia capacity commitment is short, would incur 
duopoly flosses for a short time before the incumbent 
would exit. The entrant would then take over the market 
and coh incumbency. Thus, the prospect of high steady- 
state profits together with the brevity of the fight to kick 
out the rene would encourage entry. This approach 
to contesfability is developed in more detail in the supple- 
mentary eo 


8.1.3 War of Attrition 


Another popular approach to natural monopoly is the 
war of attrition. Like the short-run capacity-commitment 
approach sketched in the preceding paragraph, it assumes 
that price adjustments take place more quickly than quan- 
tity adjustments, 

The war of attrition was introduced in theoretical biol- 
ogy, by Maynard Smith (1974), to explain animals’ fights 
for prey. Two animals fighting for prey may resemble 
two firms fighting for contro! of an increasing-returns 
industry. Fighting is costly to the animals; at the very 
least, they forgo the opportunity of other activities and 
become exhausted. Similarly, duopoly competition may 
be costly because it generates negative profits. In both 
cases, the object of the fight is to induce the rival to give 
up. The winning animal keeps the prey; the winning firm 
obtains monopoly power. The loser is left wishing it had 
never entered the fight. (For such a fight to take place, its 
outcome cannot be deterministic. Each player must have 
at least some chance of winning in order to be willing to 
participate.) In a war of attrition, each player waits and 
suffers for a while. If at some point in time his rival has 
not yet quit, a player gives up. 

The simplest example of a war of attrition is the follow- 
ing: Suppose time is continuous from 0 to +, The rate 
of interest is r. There are two firms, with identical cost 
functions C(q) = f + cq if q > O and C(O) = 0, per unit of 
time. Price adjustments are instantaneous. If the two firms 
are in the market at time #, price equals marginal cost ¢ 
(Bertrand competition) and each firm loses f per unit of 
time. If only one firm is in the market, the price is equal 
to the monopoly price, p™, and the firm makes instanta- 
neous profit [1™ — f > 0; the other firm makes zero profit. 
Both firms are in the market at date 0. At each instant, 
each firm decides whether to exit (conditional on the 
other firm’s still being in the market at that date). Exit is 
costless. For simplicity, assume that a firm that drops out 
never returns (however, the equilibrium we describe below 


moved to thd left of his average-cost curve: in a sense, the incumbent is able 
4 certain output through low prices on these units but can still 
through high prices on the marginal units. Perry shows that the 
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incumbent generally makes a stnctly positive profit, and that the existence of 3 
sustainable price-quantity strategy may not even require the natural-monopoly 
assumption (which assumption is necessary but not sufficient for the existence 
of a sustainable allocation under uniform pricing} 
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is still Jan equilibrium if costless reentry is allowed). 
Because} the market is profitable for a monopoly, the 
remaining firm stays in forever after its rival has dropped 
out. 

We 7 construct a symmetric equilibrium in which, at 
any instant, each firm is indifferent between dropping out 
and staying. For a firm to be indifferent, the expected 
profits from the two actions must be the same. Because 
dropping out at t means zero profits from that date on, 
each finp’s expected present discounted value of profits 
from any date on must equal zero. If both firms are still in 
the mare at date /, each firm drops out with probability 
xdt een tand + dt, where x = rf/(f1™ — f). To see 
that thege strategies form an equilibrium, suppose that at 
date t bdth firms are still in the market. If firm 1 drops out, 
it sa 0 from f on. If firm 1 stays in until time t + dt, 
it incurs duopoly loss f dt. However, with probability xdt, 
firm 2 diops out during this short interval of time. Firm 1 
then becomes a monopoly and, from then on, earns total 
(discounted) profits (f1™ — f)/r. If firm 2 is still in at date 
t+ dt, 1 is willing to drop out and thus make 0 from 
that datd on. Firm 1 is indifferent between dropping out 
at date t and staying until t + dt if 


O= —f Hilt + (xdb){(M1™ — f)/r) + 0. 


The indstry outcome is stochastic. Each firm drops out 
according to a Poisson process with parameter x.'3 

This clin is consistent with free reentry because 
the valud of being in is 0, so there is no reason to reenter 
once ond has left. The equilibrium is not unique,'*; how- 
ever, if we depart from our perfect-information assump- 
tion and} allow uncertainty about rival's fixed (oppor- 
tunity) cpsts (see chapter 9), and if the support of this 
uncertainty is sufficiently large, then the symmetric equi- 
librium ig also the unique equilibrium. 

The wer of attrition yields the following conclusions: 


(1) Ther? are two firms in the industry for a (random) 
length of|time (technological inefficiency); then one exits. 


(2') Firm earn no ex ante rents, but may have ex post 
profits, 


(3') The price is first competitive and then equal to the 
monopoly price. The allocation is not constrained effi- 
cient, and welfare is lower than under contestability. 


The second part of conclusion 3’ results from the fact that 
the contestable allocation is optimal, subject to the no- 
subsidy constraint. The following exercise checks this 
result for a simple specification of demand. 


Exercise 8.2"* All the firms in an industry have the same 
production cost: C(q) = f = #. (The marginal cost is 0.) 
The demand is D(p) = 1 — p. 

(i) Is this a “natural monopoly”? 

(ii) Compute the contestable allocation. Calculate the 
welfare level. 

(iii) Derive the symmetric equilibrium of the infinite- 
horizon, continuous-time war of attrition between two 
firms. Calculate the expected intertemporal welfare, and 
compare it with the welfare level from question ii. (Hint: 
For a Poisson process with parameter y, the probability 
that no arrival has occurred by time ¢ is e™.) 


Figure 8.2 illustrates the difference in price dynamics in 
the contestability and war-of-attrition theories. 

It may be instructive to look at the natural monopoly 
issue from the viewpoint of the literature on rent seek- 
ing. As was noted in chapter 1, Posner argued that the 
prospect of monopoly profits creates a contest to appro- 
priate those profits. All monopoly profits must be added 
to the usual dead-weight-loss triangle if two postulates 


War of 
ae wt en attrition 


Contestability 


Time 


Figure 8.2 
Price dynamics in natural monopoly 


B. Ta he a py ot sen op tee 
conditional qn the other firm's not dropping out, is 1 —¢"" (an exponential 
distribution), 
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14. There also exist asymmetric equilibria For example, at each date firm 1 
stays in and firm 2 exits (another equilibrium is obtained by switching the 
names of the firms), 
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hold: thg rent-dissipation (or zero-profit) postulate, which 
asserts that the firms’ total expenditure on obtaining the 
monopoly profit is equal to the monopoly profit. and the 
wastefulness postulate, which asserts that this expendi- 
ture has ho socially valuable by-products. 

Both be contestable allocation and the war-of-attrition 
or satisfy the rent-dissipation postulate. Com- 
petition for the monopoly position drives industry profits 
down to]zero,' The contestable allocation yields an in- 
teresting|reversal of the wastefulness postulate, Because 
rent dissipation occurs through low prices, it benefits the 
consume and is socially useful. The war-of-attrition equi- 
librium chmes closer to satisfying the wastefulness pos- 
tulate than the contestable allocation. Some of the profits 
are dissipated wastefully (for a while, the fixed production 
cost is duplicated). But the consumers also enjoy marginal- 
cost pricing for some time before facing the monopoly 
price ober ials allocation would correspond to monopoly 
pricing at} each instant). Thus, welfare is higher than that 
predicted|by the rent-seeking literature and lower than 
that assodiated with the contestable allocation. 

Anoth¢r interesting analogy relates to the free-entry 
biases distussed in chapter 7. As Whinston (1986) notes, 
one can view the exit decision as a reverse entry decision. 
Therefore] this decision is subject to the same biases— 
imperfect |appropriability of the consumer surplus and 
business-stealing effect—as the entry decision. Let w(p) 
denote th¢ social welfare per unit of time, gross of fixed 
cost. To illustrate the two biases, assume that there are 
two consumers, with unit demands, and that ¢ = 0. First, 
to focus aes business-stealing effect, suppose that the 
two consumers have the same valuation v for the good. 
The flow thonopoly profit is then 1" = v. A monopolist 
captures tHe full consumer surplus and introduces no dis- 
tortion in ¢onsumption. Thus, 


wc) — wip") =» —v =0<f, 


The social ain from competition per unit of time is lower 
than the flow fixed cost of production. It is socially opti- 
mal to have a single firm at any point of time even if its 


pricing behavior cannot be regulated. Thus, there is too 
little exit. Second, suppose that the two consumers have 
different valuations v, < v, and that v; > 2», so that a 
monopolist charges price v2. By charging 0, (which would 
induce a socially optimal consumption), it would capture 
only part of the total consumer surplus, Now, if f is lower 
than 


w(c) — w(p™) = (v, + 92) — v2 = v4, 


competition is valuable.‘ That is, when a firm chooses to 
exit (because its private incentive to stay is zero), a social 
planner would like it to stay—there is socially too much 
exit, because the firms do not appropriate the gain in 
consumer surplus due to competition. Thus, in a second- 
best world in which pricing cannot be regulated, a social 
planner would want to prevent any exit. 

The preceding analysis relies on strong price competi- 
tion between the two firms. Suppose they succeed in 
tacitly colluding in prices while both are still in the market 
(see chapter 6 for a discussion of tacit collusion), The 
market price is then equal to p™, independent of the 
number of remaining firms. Thus, a social planner would 
want one of the firms to exit at date 0 in order to avoid 
wasteful duplication of the fixed cost. However, suppose 
that the firms wage a war of attrition, and they lose 
(f — 11/2) > 0 per unit of time while competing. In the 
symmetric equilibrium, each firm exits with probability 
x'dt between t and t + dt, where x' is given by 


(f — Pev2)dt = |x’ (T1™ — f)/ridt, 


which yields x' < x. Because fighting for a monopoly 
position is less costly under tacit collusion, firms exit at a 
slower rate precisely when a social planner would prefer 
a single firm. Here we have an example of the business- 
stealing effect. Staying in has no social value; all profits 
are derived from diverting half the monopoly profit from 
one’s rival (and the full monopoly profit if this rival exits). 
Under tacit collusion, there is socially too little exit.'7 
The war-of-attrition paradigm has been used to try to 
predict whether big firms or small firms are more likely to 


15. One way Jof looking at this 1s as follows. The monopoly profit in a 
contestable magket turns out to be zero. No expenditure is made to obtain it. In 
contrast, in th war of attrition, the monopoly profit is the regular one. The 
‘expenditure cofresponds to the duopoly losses incurred prior to giving up or 
getting the mopiopoly situation. 
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16. In this example, N'™ =n, > 2v, > 2/. As long as firms wage Bertrand 
competition, f > 0 is sufficient for the market to be a natural monopoly, 


17. See Mankiw and Whinston 1986 and review exercise 24 for analyses of the 
free-entry biases in a homogeneous-good industry in a static context. 
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exit first] in a declining industry with increasing returns to 
scale. Seen and Nalebuff (1985) argue that big firms 
will exi} earlier, leaving the industry to the small firms. 
The intyition is that if demand declines, a big firm loses 
viability more quickly (it is too big relative to the market). 
Thus, i} a monopoly situation, a big firm would exit 
earlier thas a small firm. In a competitive duopoly, the 
small firm's anticipation that the big firm will eventually 
leave is| an incentive for the small firm to stay in the 
market. (As Ghemawat and Nalebuff show, this forces the 
big firn} to exit as soon as its instantaneous duopoly 
profit becomes negative (i.e no real war of attrition takes 
place of the equilibrium path).'* Londregan (1986) ex- 
tends this model to allow a complete product life cycle, in 
which the market grows and then declines.'® 

Whirston (1986) shows that the Ghemawat-Nalebuff 
result dbpends crucially on the big firms’ inability to “go 
on a di¢t.” He argues that, in practice, a big firm may be 
able to |reduce the number of plants and become a small 
firm wHen demand declines. He then solves for equilib- 
rium when firms can scrap plants (exit then occurs when 
the last jplant is closed) and shows that a variety of poten- 
tial “ei are feasible. Indeed, Whinston notes that in 


the decfining industry producing the antiknock additive 
for leaded gasoline, the smallest producer was the first to 
leave. Ghemawat and Nalebuff (1985) give a few exam- 


ples, including the synthetic-soda-ash industry and the 
British steel-castings industry, in which the largest firms 
exited first. 

There is a simple case for which the outcome can be 
predicted without intimate knowledge of the industry. 
Ghemawat and Nalebuff (1987) and Whinston (1986) 
show that if firms can decrease their capacities after a 
downward shock in demand that calls for exit, the bigger 
firm reduces its capacity until it is equal in size to its rival, 
and thereafter the two firms reduce their capacities sym- 
metrically (so they remain of equal size).?° 

In chapter 9 we will consider another aspect of the war 
of attrition: the possibility that each firm has incomplete 
information about its rivals’ production or opportunity 
costs. The length of time already spent in a ruinous 
oligopoly contest is then a signal that a firm is efficient 
(or has low outside opportunities, or that the market 
exerts beneficial spillovers on its other product lines). The 
link between the war of attrition, Bayesian updating, and 
Darwinian selection in an industry will be discussed. 


8.2 Sunk Costs and Barriers to Entry: 
The Stackelberg-Spence-Dixit Model 


A fascinating aspect of sunk costs is their commitment 
value. A firm that buys equipment today signals that it 


18. The ape eters model assumes that each firm faces a flow cost of 
maintainirg capacity, which is proportional to the firm's capacity (there is no 
fixed cost] independent of productive scale). With P(K, ) denoting the inverse 
demand finction at time , where K = K, + K; is industry capacity, and 
c it ‘maintenance-cum-production cost, firm i's instantaneous profit 
assuming] that both firms are still in at #) is 


[P(K, + 8.1) — eK, 


Assume that 0P/@K <0, and @P/ét <0 (ie. the industry is declining). Assume 
further thht the firm's exit decision is lumpy (so a firm's capacity jumps directly 
from Kj tq 0), Let f? be defined by 


PAK. 17) 3 


If K, > H,, then If < ¢}. That is, firm 1 would exit earlier than firm 2 in a 
monopoly situation, Backward induction shows that firm i exits first at time 
<Ef sudh that P(K, + K,,) = c, and firm two stays until #}. (Hint: At date t7, 
it is a dofninant strategy for firm 1 to exit. At date tf — @ for e small, firm 2 
would bd foolish to exit: At worst, it loses some profit during ¢, and then 
becomes h profitable monopolist from tf to (3: so, assuming costly reentry, firm 
2 stays aid firm 1 exits.) 


19, pee qv gat) 1986 and Fine and Li 1986 for analyses of the war of 
attrition fvhen the profits follow a stochastic process. 
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20, The model of Ghemawat and Nalebuff is a continuous-time, continuous: 
capacity-adjustment one. That of Whinston assumes discrete periods and 
indivisible plants of equal sizes; it does not require almost continuous reduction, 
but uses a Markov-like assumption. 

The following is a heuristic description of equilibrium, Consider the 
continuous-time model in note 18. Let R(K).#) denote firm i's static reaction 
function at t; it maximizes 


PAK, + Ky.) — aK, 


cover K,, Let (K*(t),K*(t)) denote the static Nash equilibrium, defined by 
K*(t) = RUK*(t),#). Under mild assumptions, @R/dt <0, which implies that 
dK*/dt <0. Consider now the dynamic model and assume for simplicity that 
firms can only reduce capacity. The equilibrium strategies are: If 


KAO < R(K,(0,f) for i = 1, 2, 
no firm reduces its capacity at date /, If 
K(8) < R(K (0,1) and K\(t) > R(K(O.1, 


firm i does not reduce its capacity; firm j stays on or moves to its reaction curve 
(that is, it reduces its capacity continuously if it is on its reaction curve and 
discontinously if it is above its reaction curve), If Ki(#) > R(K,(H),1) for i= 1,2, 
both firms move to the static Nash equilibrium (K*(), K*(A). They then reduce 
their capacities so as to remain on their reaction curve, The equilibrium is 
nothing but a sequence of myopic (static) Cournot outcomes. 


Chapter 8 


will be|around tomorrow if it cannot resell the equipment. 
Thus, We may conjecture that the buying of equipment— 
if it Al observed by one’s rivals—may have strategic 
effects,|and therefore is not a purely internal cost-minimiza- 
tion isque. Rivals may interpret the purchase of equipment 
as bad ews about the profitability of the market and may 
teduce|their scale of entry or not enter at all. The purpose 
of this section is to verify this conjecture. 

For Fhe modeling, we will need an explicitly dynamic 
model, Sunk costs are, by definition, a multiperiod phe- 
nomenpn, as is entry deterrence. We will also introduce 
tempojal asymmetries. Some firms will enter the market 
early, possibly because of a technological lead. We will 
see that these established firms (also called incumbents) 
accumilate a quantity of “capital” sufficient to limit the 
entry df other firms or even to make their entry unprofit- 
able. First-mover advantages thus allow the established 
firms tp restrict or prevent competition. We will think of 
“capital as equipment or machines; however, as will be 
discusspd later, the concept of capital can be interpreted 
more bfoadly. 


8.2.1 | Accommodated, Deterred, and Blockaded 
Entry 


We start with a prototypical model whose extremely 
seapitha structure allows us to highlight the concept of 
a barri¢r to entry, This model is due to Heinrich von 
Stackelherg (1934). 

Consider a two-firm industry. Firm 1 (the existing firm) 
choose$ a level of capital K,, which is then fixed. (We 
shall wi to this assumption later.) Firm 2 (the potential 
“entrank”") observes K, and then chooses its level of cap- 
ital K,,|which is also fixed. 

Assume that the profits of the two firms are specified 
by 


T1'(K,, Kz) = K,(1 — K, — K;) 
and 
TI?(K,,K2) = K,(1 — K, — K;). 


These functions will be interpreted later. (Recall from 
chapter|5 that they are the reduced-form profit functions 
that come from short-run product-market competition 
with given capacities.) For the moment, note that these 
functios have two properties that are necessary for the 
generaljzation of the results to more general profit func- 
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tions: First, each firm dislikes capital accumulation by the 
other firm (ITj < 0). Second, each firm's marginal value 
of capital decreases with the other firm's capital level 
(fli; < 0). That is, the capital levels are strategic sub- 
stitutes (see the introduction to part Il). 

For now, assume that there is no fixed cost of entry. 
The game between the two firms is a two-period one. 
Firm 1 must anticipate the reaction of firm 2 to capital 
level K,. Profit maximization by firm 2 requires that 


where R; is the reaction function of firm 2 (that is, Rz(K,) 
maximizes K,(I — K, — K,) with respect to K,). There- 
fore, firm 1 maximizes 


tk 
aS a == 
n K,(1 4 5 ), 


from which we can determine the “perfect” Nash equi- 
librium: 


K=}%=11 =41P=%. 


Despite identical profit functions, firm 1 is in a position to 
obtain more profit than firm 2 by limiting the size of firm 
2's entry. This illustrates the first mover’s advantage. We 
know that if the two firms were to choose their levels 
of capital simultaneously, each would react to the other 
optimally, so that K, = R,(K,) and K, = R,(K;). Using 
the symmetry, the simultaneous-move solution yields 


K,=K,=4 
and 
==}. 


The simultaneous-move and sequential-move outcomes 
are illustrated in figure 8.3. The broken lines represent the 
isoprofit curves. By definition of the reaction curves, firm 
I's isoprofit curve is horizontal when it crosses R, and 
firm 2's isoprofit curve is vertical when it crosses R3. 
To conform with common usage, 5 and N are used to 
denote the equilibrium outcomes in the sequential and the 
simultaneous game, respectively. They are usually called 
Stackelberg and Nash equilibria, but that terminology is 
actually misleading. The equilibrium concept is the same 
in both cases: (perfect) Nash equilibrium. The games sim- 
ply differ in their timing. In the Stackelberg game, firm 1 
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has a char{ce to choose its level of capital before firm 2 
and, therefore, to influence firm 2. 

We et that temporal asymmetry allows firm 1 to 
limit firm }’s capital level, To do this, it accumulates more 
capital thah it would have done in a simultaneous equilib- 
rium. Conyequently, the profitability of a marginal invest- 
ment for 2 is diminished, providing an incentive for 
this firm rot to accumulate too much capital. The intui- 
tion is tht same for more general profit functions; by 
raising K, | firm 1 reduces the marginal profit from invest- 
ing (113) for firm 2 (as long as TI7z < 0). Thus, firm 2 
invests lesb, which benefits its rival (I1} < 0). 

The rol of the irreversibility of capital levels (i.e., the 
fact that eee may not be reduced in the future) should be 
stressed, F[rm 1 is not on its reaction curve ex post; its best 
response tp K, =} is K, = § <4. If, after the choice of 
K;, firm 1 fould reduce K,, it would do so, However, firm 
2 would then choose K, > 4 in anticipation of this re- 
sponse, In| this sense, firm 1 loses by being flexible. The 
fact that tHe investment cost is sunk is a barrier to exit and 
allows the|incumbent to commit to a high capital level. 

Therefore, it is important that the capital investment be 
somewhat! difficult to reverse if it is to have a commit- 
ment valuf. In particular, if the machines operated by the 
established firm may easily be resold on a second-hand 
market, then it will not satisfy this condition. The com- 
mitment effect is stronger the more slowly capital depre- 
ciates andthe more specific it is to the firm (that is, when 
its resale involves large losses). 

The valfie of commitment and the corresponding notion 
of “burning one’s bridges” have widespread applicability 
beyond arenes An oft-quoted example is that of two 
armies wishing to occupy an island located between their 
countries hnd connected by a bridge to both (figure 8.4). 
Each army prefers letting its opponent have the island to 
fighting. pe 1, which is somewhat knowledgeable in 
game thepry, occupies the island and burns the bridge 
behind it.|Army 2 then has no option other than to let 
army 1 halve the island, because it knows that army 1 has 
no choice other than to fight back if army 2 attacks. This 
is the parhdox of commitment: Army 1 does better by 
reducing Is set of choices. 

The abpve equilibrium demonstrates how the incum- 
bent (firm]1) can reduce firm 2's scale of entry. Following 
Caves and Porter (1977), we denote this as a barrier to 
mobility. We will also say that firm 1 accommodates entry, 
in that it thkes entry for granted and simply tries to affect 
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Figure 8.3 
Stackelberg outcome, 


Army 1 Island Army 2 
Figure 8.4 


firm 2's subsequent behavior. Firm I cannot deter entry in 
this model. Firm 2 declines to enter (K, = R,(K,) = 0) 
only if K, > 1, which would yield negative profits to firm 
1. In economic terms, this means that it is always worth- 
while for firm 2 to enter, even on a small scale. If firm 1 
makes positive profits, firm 2 can choose a small level of 
capital, hardly affect the market price, and make a profit 
itself. 

Such small-scale entry becomes unprofitable under 
increasing returns to scale. To illustrate the possibility of 
entry deterrence, let us introduce a fixed cost of entry, f, 
into our model. Assume that firm 2 has the following 
profit function: 


Kit —Kj—f KS 0 


2, = 
a kk) = ff if K, =0. 


Suppose that f < 7. If firm 1 chooses K, = 4 as before, 
firm 2 chooses K, = } and makes a profit of (#s — f) > 0. 
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However, this choice of K, may not be optimal for firm 1, 
wi may be able to increase its profit by completely 
preyenting the entry of firm 2. K?, the capital level that 
discburages entry, is given by?! 

man|K,(1 — K, — KP) —f]=0, 


K 


or 


KPe1—2,/F>4. 


with that in figure 8.3 up to KP, and then coincides with 
orizontal axis. When entry is deterred, the profit of 
firm |1 is 


m i —2/f)la — (a —2/f)l = 2/f 1 —2/f). 
i 


it close to 7s, this profit is greater than §. Therefore, 
firm |I is interested in completely discouraging, not sim- 
ply testricting, the entry of firm 2. Firm 1 accomplishes 
this by choosing K, = K?, because accumulating beyond 
K? would reduce profit (KP is greater than the monopoly 
capithl level of 4).2? 

In|Bain’s terminology, the equilibrium for f a bit below 
de is es of deterred entry, whereas the one for f = 0 (or, 
more| generally, f small) is one of accommodated entry. 
With|f > 7x. firm 1 blockades entry simply by choosing 
its mpnopoly capital level, Kj" = 4.7% 


mi reaction curve, depicted in figure 8.5, coincides 


Exercike 8.3" — Indivisibilities may, like a fixed cost, lead to 
a mohopolistic structure if combined with a first-mover 
advar}tage. Suppose that firms must build an integer num- 
ber of plants: 0, 1, 2, .... Building plants costs (3.5)n. 
Each plant produces one unit of output, there is no variable 
cost, and the market price is p = 6 — K, where K is the 
indusiry’s total capacity (number of plants). 

(i) $how that a monopolist installs one plant. 

(ii) Consider duopolists simultaneously choosing their 
numbé¢rs of plants, K, and K;. Let p=6—K, — Ky. 
Show that in the Cournot equilibrium each firm builds one 
plant. 


kK 


kK, 


% 


Figure 8.5 
A fixed cost of entry implies a minimum capital level. 


(iii) Suppose that firm 1 builds before firm 2. Show that 
firm 1 builds two plants and firm 2 stays out. Comment 
on the similarities and differences with the continuous- 
investment-cum-fixed-cost case. 


8.2.2 Discussion and Extensions 


8.2.2.1 Reduced-Form Profit Functions 


We now return to the interpretation of the profit func- 
tions. Stackelberg actually wrote his two-stage game in 
terms of quantities. This left (at least) three questions 
unanswered: What does quantity competition mean? Why 
does one of the firms enjoy a first-mover advantage (i.e., 
choose its quantity first)? Why does quantity have a com- 
mitment value? Spence (1977, 1979) and Dixit (1979, 
1980) made the Stackelberg story consistent, basically by 
interpreting Stackelberg’s quantity variable as a capacity 
(as we did notationally), Doing so provides answers to 
the three questions: First, the profit functions represent 
reduced-form profit functions after one has solved for 
short-run product-market competition given the capacity 
levels. Second, the first-mover advantage may come from 
the fact that one of the firms obtains the technology 
earlier or is quicker to act than the other firm. Third, 
capacities have a commitment value to the extent that 
they are sunk. 


Remark 1 In chapter 5, we derive the reduced-form profit 
functions by solving capacity-constrained price competi- 


21. The}superscript b stands for barrier 


nt ly. K, = 4 satisties the first-order condition and the second-order 
eonditioh locally for firm 1. However, since the reaction function of firm 2 
Is discowtinuous at K}, firm 1's objective function is not globally concave. 
Thus, Kj| = 418 not necessarily: the maximum. 


317 


23, That the monopoly level coincides with the incumbent's capital level under 
accommodated entry \s an artifact of the quadratic profit functions we have 
chosen. 
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tion. Spdnce and Dixit depart from this in two respects, 
First, they take short-run competition to be quantity com- 
petition father than price competition. Second, they allow 
firms to 4ccumulate more capacity during product-market 
competitjon, Consider Dixit's (1980) model. In the first 
period, firm 1 chooses capacity K, at cost coK,- This 
capacity |may subsequently be increased, but cannot be 
reduced. |Firm 2 observes K,. Then, in the second period, 
the two firms choose their outputs (q, and q,) and their 
capacities (K, and K,) simultaneously, with K, > K,. Pro- 
duction involves cost c per unit of output, Output cannot 
exceed chpacity: q; < K;, for all i. The price is equal to the 
market-clearing price, given the outputs. 

Firm } faces short-run and long-run marginal costs 
equal to [) + cand clearly chooses identical capacity and 
outputs (K, = q). For q, <K,, firm 1 incurs the short- 
run marginal cost c; any unit of output beyond K, costs 
in marginal cast, cy + c. The short-run marginal- 
cost curye is represented in figure 8.6, which suggests 
why caphcity has a commitment value: It lowers the ex 
post marginal cost of producing up to K, and hence makes 
the prodhction of the first K, units attractive in the sec- 
ond peripd. To make this precise, we can consider two 
reaction functions. Were firms to choose their levels of 
capital afd output simultaneously (that is, were there no 
first-mover advantage), they would face cost cy +c for 
each uni of output at the date of the production decision. 
With the demand curve assumed linear (p = a — bq), firm 


that is nbt used for production here.) Thus, the reaction 
function }s 


Ri(q)) ={a — bqy — co — 0/28. 


Next cofsider the Dixit two-stage game in which firm 
1 choosds its capacity in the first period and its produc- 
tion in the second period, whereas firm 2 chooses both 
capacity Jand production in period 2. Firm 2's reaction 
function pt date 2 is 


R2(q,) 4 (a — bq, — &g — 0)/2b. 


Marginal cost 


+e 


% 


Figure 8.6 
Short-run marginal cost. 


However, firm I at date 2 has a different reaction func- 
tion, called a short-run reaction function. Up to K,, it 
incurs marginal cost c only, and therefore it has reaction 
function 


Ry (qa) = (a — bq2 — 0)/2b > Ry (q2). 


Beyond K,, the short-run and long-run reaction functions 
coincide: 


R,(q2) = Ry (qa)- 


The second-period equilibrium as a function of K, can 
thus be obtained from the intersection of R, and R, in 
figure 8.7. 

As can been seen from figure 8.7, firm 1 has no incen- 
tive to invest in the first period in capacity that it does 
not use ex post.?* Furthermore, firm 1 gains by investing 
beyond the Nash capacity Ki’, because this moves the 
equilibrium to the right of N along R, which increases 
firm 1's profit. 

This does not explain how prices are determined. It is 
assumed that the market price “clears the market.” For 
instance, the market price for linear demand p = a — bq is 


p=a— bq + q2). 


Assuming that the firms in the second stage produce to 
capacity, 4; = K; (this actually is part of the derivation), 
we can write the reduced-form profit functions 


T1'(K;, Kj) = K\la — co —¢ — BIK, + Kj), 
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24. There would be idle capacity if firm 1's short-run reaction curve intersected 
firm 2's reaction curve at some q, < K,, Firm 1 could obtain the same product- 
market outcome by accumulating only q,, thus saving cy(K; — 4)) 
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Second-period 
equilibrium 


Figur¢ 8.7 
Short-hun and long-run reaction functions, 


which have the previous form for a —ro—c = 1 and 
b=], 

Ag usual, the presence of an auctioneer is not com- 
satisfactory. A more realistic description of the 
Spende-Dixit game might involve a “double-capacity- 
constfained game.” The first capacity constraint refers to 
the pfoduction capacity, which limits the level of output; 
the marginal cost is c as long as q,<K;,. The second 
capacity constraint refers to the selling capacity, which 
limits|the level of sales—firm i cannot sell more than it 
has pfoduced: x; < qj, where x; is the level of sales. This 
etation simply adds a third stage, in which firms 
choose prices constrained by their outputs, 

A question studied by the literature is whether the in- 
cumbgnt firm uses its capacity after deterring entry. That 
is, dogs firm 1 hold idle capacity to deter firm 2's entry? 
Using}quantity competition as the paradigm of short-run 
produtt-market competition, Spence answered in the affir- 
mativg. But Dixit showed that Spence’s result was due to 
the fatt that his equilibrium was not a perfect equilib- 
?F Indeed, with a concave demand function, any 
held to deter entry is used by the monopolist. 
Bulow| et al. (1985a) show that Spence’s excess capacity 


that the reaction curves are upward sloping. 
Schmnalensee (1981) uses the Spence-Dixit model; how- 
ever, instead of introducing a fixed cost of entry, he 


assumgs that a firm cannot produce below some minimum 
output Ko if it produces at all (so q; > Ko). He 


interprets K, as a minimum efficient scale. Using empirical 
evidence that the minimum efficient scale is often low 
relative to industry demand (generally under 10 percent), 
he argues that such barriers to entry cannot account for 
established firms’ high profits. 


Exercise 8.4"* The first part of this exercise recalls how 
reduced-form profit functions can be deduced from short- 
run price competition. The second part (inspired by 
Matsuyama and Itoh [1985]) shows how the model of 
barriers to entry can be used to analyze the desirability of 
protecting an infant industry. 

(i) Two firms produce perfect substitutes at zero mar- 
ginal cost (up to a capacity constraint), The demand func- 
tion is p = 4 — (q, + qa). Firms are capacity constrained: 
9; S K,. Capacity costs 3 per unit. Use the monopoly 
solution to show that K, cannot exceed 1. Use this upper 
bound to conclude that when firms are capacity con- 
strained and choose their prices simultaneously (with 
the capacities fixed and common knowledge), both firms 
quote price p = 4 — K, — K,; to show this, posit either 
the efficient-rationing rule or the proportional-rationing 
tule, 

(ii) Firm 1 is a foreign firm, firm 2 a domestic one. 
Consider the following “no-protection” 3-stage game: 

1. Firm 1 chooses capacity K;. 

2. Firm 2 chooses capacity K;, knowing K, . 

3. Firms choose prices simultaneously, knowing K, and 
Re 

(That is, the foreign firm has a first-mover advantage.) The 
domestic firm faces an entry cost f = 7k. Compute the 
equilibrium and the welfare (where welfare = consumer 
surplus + profit of domestic firm). Show that a policy of 
“limited protection,” which forces the foreign firm to wait 
until period 2 to invest domestically (so that both firms 
choose K, and K, simultaneously) increases welfare. 


Remark 2 Interpreting profit functions as reduced-form 
functions for price competition under capacity constraints 
allows us to perform some welfare analysis. In the Stackel- 
berg example (see subsection 8.2.1), let p = 1 — K denote 
the demand function, where K = K, + K, denotes indus- 
try capacity and output. (The intercept of the demand 


25. See the Game Theory User's Manual on the notion of perfect equilibrium. 
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function {s net of investment and production costs—see 
remark 1}. The social optimum in this industry is to pro- 
duce indystry output K = 1. In duopoly, the welfare loss 
is measuted by the area of the triangle between the de- 
mand cugve and the marginal-cost curve (which, here, is 
the horizpntal axis, as the marginal cost is normalized to 
zero); se¢ chapter 1, If p is the market price, the welfare 
loss fron] monopoly or duopoly pricing is equal to p?/2. 
If the entrant enters, the fixed cost f of entry must be 
added tq the welfare loss, because the socially optimal 
productipn involves only one firm (the entrant does not 
bring any cost savings). 


t's addition to the capacity of the industry is 
socially| beneficial if it is privately beneficial from the 
nonappyopriability-of-consumer-surplus effect (as long as 
the capacity is used). The incumbent's increase in capacity 
to deter entry also yields some increase in the industry's 
capacity without wasting the entry cost. 


8.2.2.2 | Multiple Incumbents 


Several] authors have studied either entry deterrence 
with seyeral incumbents or models of sequential entry— 
Bernheim (1984), Gilbert and Vives (1986), McLean and 


Riordan (1985), Vives (1985), and Waldman (1987), among 
others. 

One of the issues*® addressed in this literature is 
whether entry deterrence is a public good, The one in- 
cumbent—one entrant model considered above suggests 
the following conjecture: To deter entry, the incumbent 
incurs a cost. With several incumbents, entry deterrence 
becomes a public good; if the first incumbent deters entry 
by accumulating a large amount of capital, the other 
incumbents also benefit. Every incumbent would like 
entry to be deterred but would prefer not to incur the 
associated cost. 

To understand why underinvestment by the incum- 
bents to deter entry might occur, it is useful to come back 
to the classic noncooperative subscription problem. Con- 
sider a community with two individuals. This community 
can implement a project that costs $1, Each member of the 
community has value $2/3 for the project. Thus, no one 
is willing to bear the whole cost; however, cooperative 
action is desirable, because the social value of the project, 
$4/3, exceeds its cost, $1. Suppose that the members 
choose simultaneously how much to invest in the project. 
If $1 or more is collected, the project is implemented; 
otherwise it is not. (Any remaining money is redistributed 
according to some rule.) There are two kinds of pure- 
strategy Nash equilibria in this game. In the first, no one 
contributes and the project is not implemented. In the 
second, each member i contributes by subscribing an 
amount a, such that a, + a, = 1 and the project is imple- 
mented, (There is a continuum of such equilibria, indexed 
by, say, a, in (4,3), if the money is given back to the 
contributors because the project is not realized.)*” 

Now consider a situation in which two incumbents 
(firms 1 and 2) choose their capacities simultaneously, The 
entrant (firm 3) stays out if and only if K, + K, > K®, 
where K" is the entry-deterring industry capacity. The 
capacities K, and K, are analogous to the subscriptions in 
the preceding paragraph. It would seem that we face a 
public-good problem, with the possibility of too little 


20. This dssumes that both firms choose their Cournot outputs equal to 4. 
Note that|the entrant in this equilibrium makes profit 4 —f > 0. This ts the 
only pure}trategy equilibrium under our assumption that f < fg. (For f > Ys, 
there exisls another equilibrium, with firm 1 producing its monopoly output. 
4.and firrd 2 not entering.) 


27. That fs, 2V/ —4f>4 oF [2 0.0054. We assume as before that the 
incumbeni uses its entry-deterring capacity, 2,/J. See remark 1 above 
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28. Some policy interventions against entry deterrence are ambiguous, For 
instance, as Bernheim notes, making entry deterrence harder for the second firm 
entering a market reduces this firm's prospects and makes entry deterrence 
easier for the frst firm in the market 


29. See exercise 116 in the Game Theory User's Manual for an under 
investment result with a continuous-size project choice 
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eer investment from the point of view of the 
incumbents. Gilbert and Vives (1986) show that this 
intuition) may be misleading. The reason is that, contrary 
to the usual public-good problem. supplying the public 
good oh thus contributing to entry deterrence) is not 
necessarjly costly. Suppose that entry is deterred and 
K, + K,]= K°. Let incumbent i's profit be 


K(P(K®)| — co — ©). 


where P{-) is the inverse demand function and where co 
and ¢ “ the investment and ex post variable costs per 
unit. Bedause the price must exceed the total unit cost, 
each fir} would like to have the highest possible capital 
level for|this given price. Thus, conditional on the actual 
deterrente of entry, each firm would like to contribute to 
entry dere as much as it can (in contrast with the 
public-gqod situation). Gilbert and Vives actually find 
that only overinvestment can occur. The exercise below 
suggests|in more detail why this is so. For more general 
models, the conclusions are more ambiguous; see Waldman 
1987 and McLean and Riordan 1985. 


Exercise §.5°** Consider the two incumbents—one en- 
trant ganje above. Let 
fee + K, + Ks) —t9 — 0) 


denote figm i's profit for i = 1, 2, where 


— RYIK, +K;) for K, + K,<K* 

dies) for K, + K; > K® 
(firm 3 fates a fixed cost of entry). Let 
TT? = K*IP(K*) — cp — 


denote the industry profit when firms 1 and 2 just deter 
firm 3's exptry. Show that if the noncooperative equilibrium 
between [the incumbents allows entry, then I! + T? 
> II° (so|there is no underinvestment in entry deterrence 
by the indumbents). 


8.2.2.3 Entry for Buyout 


We assumed that the post-entry market organization takes 
the form f competition between the incumbent and the 
entrant (if the entrant enters). However, suppose that 


there are no impediments to mergers—i.e., there is no 
jegal prohibition, there is no asymmetry of information 
about the value of assets, there are no direct costs of 
transferring assets, and it is possible for the asset seller to 
commit not to come back and reinvest in this market. The 
market structure may then be a monopoly if the incum- 
bent buys the entrant or vice versa. Indeed, if mergers are 
costless, firms have an incentive to merge after the en- 
trant enters, because a monopoly can do at least as well 
as duopolists as long as it owns the two firms’ assets. Of 
course, the distribution of the gains from monopolization 
is determined in the bargaining process for buyout and 
depends on the “threat point,” ie. on the profits the two 
firms would make if they were to reach no agreement and 
compete in the product market. As long as the entrant has 
some bargaining power, he can extract part of the in- 
crease in industry profits associated with the merger. This 
means that, for a given investment (here, capacity), the 
possibility of a merger increases the entrant's post-entry 
profit. The bottom line is that the prospect of buyouts 
encourages entry. But we should note that the merger ex 
post increases market concentration. One example of a 
socially perverse effect is that the incumbent may buy the 
entrant's capacity and scrap part of it (that is, the incum- 
bent may hold excess capacity after the merger). For more 
on these ideas, see Rasmusen 1987. 


8.2.2.4 Uncertainty 


Maskin (1986) extends Schmalensee’s version of the 
Spence-Dixit model to allow for uncertainty about de- 
mand or short-run marginal cost. He argues that uncer- 
tainty forces the incumbent to choose a higher capacity to 
deter entry than he would under certainty. This increases 
the cost of entry deterrence, making it less likely. 


8.2.2.5 Capital Accumulation 


The basic model is very simplistic in that it assumes that 
firms can accumulate their capacities all at once. Further- 
more, these capacities cannot be reduced and do not 
depreciate. In practice, capacities are accumulated and 
adjusted over time (possibly in a lumpy way, owing to 
technological indivisibilities). Capacity expansion imposes 
adjustment costs.*° Furthermore, demand grows at the 
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30. See Prescott and Visscher 1980 for a model of internal organization that 
explains adjustment costs 
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beginning of the product's life cycle, making early com- 
plete dapacity accumulation a costly strategy. Thus, it 
is worthwhile to study capital-accumulation games, in 
which firms vie for a Stackelberg leadership position; see 
section] 8.6.1. 


8.2.3 | Other Forms of Capital 


how physical capital may facilitate the erection of 
to entry. Other kinds of capital may have the 


+ Learning by doing In certain industries, the experience 
by the established firms during previous produc- 


ty be considered to be a form of capital. This 
ice gives the existing firms a competitive advan- 
\d therefore it can discourage others from entering. 
Indeed] certain consulting firms (the Boston Consulting 
Group, for example) have suggested that intense early 
production promotes learning by doing and thus can be 
used ns PH ES for this purpose. The argument is, how- 
ever, a pit less clear-cut than it seems, as will be shown in 
section] 8.4. 


Exercise|8.6"* (i) A monopolist faces demand curve q = 1 
— p injeach of two periods (A and B). Its unit cost is ¢ in 
period |A and ¢ — 4q* in period B, where q* is the first- 
period output (the firm learns by doing). The discount 
factor Hetween the periods is 6 = 1. Show that the first- 
period pve is d/(2 — A), where d = 1—c, 

(ii) Suppose now that the monopolist (firm 1) faces an 
entrant] (firm 2, with unit cost c) in the second period. 
They play Cournot (quantity) competition, which yields 
profits 


TP = (fl + cf — 2cP)7/9 


and outputs 
gh = (1 + cf — 2cP)/3, 


Write the first-order conditions determining qi when (a) 
qf is not observed by the entrant before second-period 
competition and (b) qi is observed by the entrant. In 
which case is the monopolist’s first-period output higher? 
(You need not compute qj; just give the intuition and the 
interpretation in terms of business strategy.) What could 
change if the entrant were to face a fixed cost of entry? 


+ Developing a clientele The decision to develop a clien- 
tele is a capital decision that increases the demand for the 
product of the established firm. Clearly, if the clientele 
attached to the existing firm is considerable, the potential 
demand for the entrant is weak. This is well understood 
by firms that launch advertising and promotional cam- 
paigns not only to make their product known but also to 
“preempt” demand. The more imperfect the consumers’ 
information and the more important the costs of switch- 
ing suppliers, the greater the clientele effect.*? 

+ Setting up a network of exclusive franchises This is a 
capital decision that increases the entrant's distribution 
costs.*? The established supplier can assure himself of the 
services of the more capable franchisees by selecting them 
initially and imposing exclusivity on them.** Such an ex- 
planation is offered by some economists for the initial 
difficulty encountered by foreign producers attempting 
to enter the American automobile market (however, this 
argument is debatable because exclusive contracts are 
often of short duration). 


The last two barriers—developing a clientele and fran- 
chising in the distribution network—are preemptive strate- 
gies. Two other important examples of such strategies are 
the following: 


+ Choosing a “strategic place” in a geographical or prod- 
uct space is often important because of its commitment 


31 =e enunciations of the conventional wisdom, which may over- 
simplify replity. Two of the examples will be discussed in more detail below. 


32, Considering a clientele as a form of capital suggests that the existing firm 
should ovprinvest to block the entry of other firms. Even though such a 
strategy my be possible, it is not necessarily optimal, for the following reason, 
If entry dhes occur, the established firm has two types of customers after 
entry; its qwn clients (over whom it still has monopoly power) and the other 
consumers (for whom it {s competing with the entrants). Of course, the firm 
wants to stt.a high price for the captive clientele and a lower price for the other 
consumers If it cannot price-discriminate, the firm must charge an intermediate 
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price; this intermediate price is higher the more important is the captive 
clientele. Consequently, the existing firm is less aggressive after entry, when it 
has a large clientele; it has become a “fat cat." which may make entry profitable. 
‘Therefore, overinvesting in clientele may not necessarily be the best way to 
prevent entry. See Schmalensee 1983, (See also Baldini 1983, Fudenberg and 
Tirole 1984, and note 43 below.) 


33, Salop and Scheffman (1983) include this type of strategic behavior in their 
category of behaviors that “raise the rival's costs." 


34. See subsection 4.6.2. 
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effect |the fixed cost of establishing oneself cannot be 
easily fecouped; “the firm is here to stay”). See subsection 
8.6.2 for further discussion. 


competitive behavior by existing firms. Those responsible 
for fostering competition (antitrust authorities) are well 
aware {that things are not so simple, They have a very 
difficult time proving that a certain type of behavior is 
detrimental to competition. In fact, they have less infor- 
mation than the firms about demand functions, cost struc- 
tures, {he quantities of accumulated capital, and so on. 
Government decision-makers face a dilemma. Certainly 
they c4nnot prosecute an existing firm for increasing the 
demant for its product by providing information to con- 


But hoW can we know if a firm has accumulated its “cap- 
ital” ir] a totally innocent fashion? The problem is that 
most qf the decisions that make a firm healthy also ele- 
vate il} to a power position with respect to potential 


The paint of the Stackelberg model is that commitments 
matter|because of their influence on the rivals’ actions. 
In the Fapacity-accumulation game, the incumbent over- 
invests} to force the entrant to restrict his own capacity. 
The ggals of this section are to define the notions of 
“overinvestment” and “underinvestment” and, more 
generally, to supply a two-period framework within 
which fo think of business strategies, including a taxon- 
omy of possible strategies. The ideas that underlie this 
section] have been known informally for a long time. 


Recently Fudenberg and Tirole (1984) and Bulow, Geana- 
koplos, and Klemperer (1985b) have independently offered 
a framework that systematizes these ideas.*° The out- 
comes of many strategic interactions in industrial organi- 
zation can be predicted using the basic framework of 
strategic effects in the simple two-period model. 

Consider the following two-period, two-firm model. In 
period 1, firm 1 (the incumbent) chooses some variable K, 
(for example, capacity), We will call K, an investment, 
although, as we will see, that word must be taken in a 
very large sense. Firm 2 observes K, and decides whether 
to enter. If it does not enter, it makes zero profit. The 
incumbent then enjoys a monopoly position in the sec- 
ond period and makes profit 


T1'™(K,, x7"(K;)), 


where x/"(K,) is the monopoly choice in the second period 
as a function of K, (for instance, x, is firm 1's output). If 
firm 2 enters, the firms make simultaneous second-period 
choices x, and x. Their profits are then 


TH(Ky, 4,22) 
and 
T(K,,xy,22). 


By convention, firm 2's entry cost is part of TI*, These 
functions are assumed to be differentiable. 

Suppose that firm 1 chooses some level K, (take it as 
given in this paragraph) and that firm 2 enters. The post- 
entry choices x, and x, are determined by a Nash equilib- 
rium. The subsequent analysis of the effect of changes 
in K, on the Nash equilibrium assumes that this Nash 
equilibrium, 


{x7(Ky), 23(K,)}, 


is unique and stable. “Stability” has to do with the follow- 
ing thought experiment: Suppose that firm 1 picks an 
arbitrary x,. Let firm 2 react by choosing an action R3(x,) 
that maximizes TI?(K,,x,,x2) over x,. Then let firm 1 
react to R;(x,) by choosing an action Ry(R3(x,)) that 
maximizes T'(K,, Z,,R3(x,)) over 2. And so forth. This 
yields a sequential adjustment process in which both firms 
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36, The terms strategic complements and strategic substitutes were coined by 
Bulow et al. The “animal” terminology is taken from Fudenberg and Tirole 


Entry, Accommodation, and Exit 


Figure 8.4 
Stable seednd-period equilibrium. 


are re ig they ignore the effect that their adjust- 
ment haf on their rival; alternatively, they are rational 
but have discount factor 6 = 0). The Nash equilibrium, 


(21K), 40K}. 


is stable |if such an adjustment process converges to the 
equilibriym allocation from any initial position.>” Stability 
is illustrated in figure 8.8. 

Let us|now consider the incumbent's first-period choice 
of K,. We will say that entry is deterred if K, is chosen 
such tha! 


T17(K,, af (Ky), 23(K,)) < 0. 


(This indludes the case where entry is blockaded, that is, 
where tHe monopoly choice of K, deters entry.) Entry is 
accommpdated if 


T1?*(K, af (K,), 23(K,)) > 0. 


Which df the two cases must be examined depends on 
whether|the incumbent finds it advantageous to deter or 
to accomodate entry. For simplicity, we will also as- 
sume that 


TE(K,,x8(K,), x3(K,)) and TP"(K,, 2(K,)) 


are strictly concave in K, and that the functions x/(-) are 
differentiable. 


8.3.1 Deterrence of Entry 


We ignore the uninteresting case in which entry is block- 
aded (that case is void of strategic interactions). Thus, 
the incumbent chooses a level of K, so as to just deter 
entry**: 

TP(K,, 21(K,), 4(K,)) = 0. 


Let us consider which strategy firm I can use to make firm 
2's entry unprofitable. For this, let us take the total deriva- 
tive of IT? with respect to K,. From the second-period 
optimization, 


om? 
Ge, Reh ), 2(K)) = 


Thus, the effect of K, on Il? through firm 2's second- 
period choice should be ignored (this is the envelope 
theorem). Only two terms remain: 


am? _ an? OM? dxj 

aK, OK, Ox, dK 
Direct Strategic 
effect effect 


By changing K;, firm 1 may have a direct effect on firm 
2's profit (011?/@K,). For instance, if K, is the clientele 
accumulated by firm 1 before the entry of firm 2, a greater 
clientele reduces the size of the market and thus lowers 
firm 2's profit independent of any strategic effect. Often, 
however, dI1?/@K, = 0. This is the case when K, is an 
investment that affects only firm 1's technology, such as 
the choice of a capacity or a technique.*® Any effect on 


37. For mofe on stability in oligopoly models, see Cournot 1838, Fisher 1961, 
Hahn 1962] Seade 1980, and Dixit 1986. The condition for local stability is 
11,113, > 14, 11},. (Hint: Compare the slopes at the Nash equilibrium.) For 
a version jn which firms behave rationally (that is, anticipate subsequent 
reactions ard discount the future), see subsection 8.6.1.1 


38. From he continuity of T1' and 11? and the uniqueness of x} and x3. 
3f(K,) and k3(K,) are continuous in K, (from the “theorem of the maximum”), 
Hence, I? |s continuous in K,. Suppose that 


TH*(K, x7(H,) 23(K;)) < 0. 
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Then firm 1 can increase or decrease K, a bit while still deterring entry 
(from the continuity of 11?). This means that the constraint that firm 1 deters 
entry is not locally binding at the optimal K,. From the concavity of 1 and 
that of 11, the entry-deterrence constraint is not binding globally, which 
means that entry is blockaded (the case we ruled out), 


39. Unless firm 1’s investment bids up the price of investment goods for firm 
2. or firm 1's investment has spillover or learning effects on firm 2. 
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2's profit is then channeled through firm 1’s post- 
choice. The strategic effect comes from the fact that 
changes firm I's ex post behavior (by dr}/dK,), thus 
cting firm 2's profits (in proportion to d11?/dx,). The 


tegic effects. 


top og: Be big or strong to look tough or aggressive. 
puppy dog: Be small or weak to look soft or inoffensive. 
lean|and hungry look; Be small or weak to look tough or 
aggtessive. 


fat 


t: Be big or strong to look soft or inoffensive. 


Ifinvestment makes firm 1 tough, then firm 1 should 
“ovarinvest” to deter entry; that is, it should use the “top 
dog'| strategy. If investment makes firm 1 soft, that firm 
should “underinvest” (ie., stay lean and hungry) to deter 


% 


Figure 8.9 
A firm’s reaction curve moves outward with a decrease in marginal 
cost. 


stitutes, the effect of an increase in K, can be represented 
as in figure 8.9. When firm 1’s cost decreases, that firm 
has an incentive to produce more, which lowers the mar- 
ginal value of output for firm 2. The new equilibrium 
involves a higher output for firm 1 and a lower output 
for firm 2. The main point, though, is that investment 
makes firm 1 tough (it raises qj, which hurts firm 2). 
Hence, the “top dog” strategy is appropriate to deter firm 
2's entry. 


Exercise 8.7" Suppose that, in the modified version of the 
Spence-Dixit game discussed above (where firm I's in- 
vestment reduces its marginal cost), the second-period 
competition is in prices. The two products are differen- 
tiated and are substitutes (see, e.g., the location model of 
chapter 7). Prices are strategic complements. Using a 
diagram, argue that firm 1 must overinvest to deter entry 
(assuming that entry is not blockaded). 


Example A case was mentioned earlier in which K, was 
firm 1's pre-entry clientele. (One may, for instance, think 
of K, as firm I's expenditures that makes switching costly 


way. Consider the hypothetical situation in which K; is nat observed 

before the entry and second-period decisions. The 

is usually called an open-loop equilibrium, because firm 2's strategy 

contingent on the actual choice of K,, which is not observed at the 

ion. (A closed-loop strategy would depend on the actual Jevel of 

K,.) The ppen-loop case is an interesting benchmark against which to compare 
of an observable change in K,. If investment makes firm 1 tough, 
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the equilibrium. entry-deterring level of K, exceeds the open-loop level 
(overinvestment): and conversely if investment makes firm 1 soft 


41. In section 8.2, the investment was a capacity level, and ¢, was not constant 
with output. But the important feature is that the investment reduces the 
marginal cost 


42. The proof of this is the same as the proof that a monopoly’s optimal price 
increases with its marginal cost (see chapter 1). 
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to at least s 
K, is to redu 


e of its customers.**) The direct effect of 

firm 2's potential market (aM?/8K, <0). 
However, thd strategic effect has the opposite impact on 
firm 2's profil if firm 1 is not able to price-discriminate 
between its cdnsumers; ideally, firm 1 would like to charge 
a high price to its captive clients and a low price to the 
noncaptive segment of the market, for which it is compet- 
ing with firm 2. In the absence of price discrimination, 
however, an |intermediate price is quoted, which intui- 
tively increasqs with the size of the captive clientele. That 
is, a sizable dlientele may make one a pacifistic fat cat, 
which is bad fbr entry deterrence. The overall effect dT1?/ 
dK, is thus ambiguous, and, depending on the parameters, 
either the “tdp dog” strategy or the “lean and hungry 
look” strategy may be appropriate to deter entry. 


8.3.2 Accoinmodation of Entry 


Suppose now Fhat firm 1 finds deterring entry too costly. 
Whereas firm) 1's first-period behavior in the entry- 
deterrence casb was dictated by firm 2's profit, which had 
to be driven dwn to zero, it is dictated by firm 1's profit 
in the entry-adcommodation case. The incentive to invest 
is given by th¢ total derivative of 


11 (K,,.2f(K,), fe (K;)) 


with respect td K;. 

From the envelope theorem, the effect on IT! of the 
change in firm 1’s second-period action is of the second 
order. Thus, oftr basic equation in the entry-accommoda- 
tion case is 


2 

aK, OK, Ox, dK, 
SS Ss 
Direct Strategic 

effect effect 


Again, we can decompose this derivative into two effects. 
The direct or “cost minimizing” effect is m'/dK,. This 
effect would exist even if firm 1's investment were not 
observed by firm 2 before the choice of x3, and therefore 
could not affect x. Thus, we will ignore this effect for the 
purpose of our classification. The strategic effect results 
from the influence of the investment on firm 2's second- 
period action. In the case of entry accommodation, we 
will say that firm 1 should overinvest (underinvest) if the 
strategic effect is positive (negative).** 

The sign of the strategic effect can be related to the 
investment making firm 1 tough or soft and to the slope 
of the second-period reaction curve. To do this, assume 
that the second-period actions of both firms have the 
same nature, in the sense that 6I1!/dx, and @11?/dx, have 
the same sign. For instance, if the second-period competi- 
tion is in quantities (prices), OI1'/dx, < 0 (>0). Using the 
fact that 


dey _ (dxh (Z)- se (Ge 
aK, - (#2) aK, ) ~ RGN a 


by the chain rule, and arranging, we obtain 


i (Ga Se a aaiethhy 
“8"\ ax, dK) "®"\ ox, dK, oe 


43. For other examples of clienteles, see Schmalensee 1983, Baldini 1983, and 
Fudenberg and Tirofe 1984. Of particular interest here are the switching-cost 
models of Klemperey|and that of Farrell and Shapiro, Klemperer (1984, 1985a, b) 
analyzes the effect of switching costs in a two-period duopoly model in which 
there Is competition fn the first period and ex post monopoly (due to lock-in) in 
the second period. He shows how frequent-flyer discounts given by airlines to 
consumers in the firs} period to be used in the second period lead to weak price 
‘competition in the sdsond period and may not benefit the constimers. (Frequent- 
flyer discounts differ] from the most-favored-nation clause, discussed in section 
8:4 below, in that the second-period discounts are not attached to the first 
period price. They )ptherwise have similar collusive second-period effects.) 
More generally, the fecond-period rents stemming from switching costs induce 
intense first-period cmpetition. Farrell and Shapiro (1987) introduce successive 
cohorts of consumer (via a model in which generations overlap) and show how 
a larger firm with a qentele may “milk” that clientele by charging a high price 
whereas a smaller fin charges a low price to attract young customers and build 
a clientele, For backround on switching costs, see von Weizsicker 1984 and 
the discussion of cliehteles in chapter 2 of the present volume 
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44. Again the concepts of over- and underinvestment can be characterized by 
comparing the optimal K, to the open-loop solution (ie., the solution of the 
same game except that K, is not observable by firm 2 prior to its decision). See 
footnote 40, Given the concavity of 11', the optimal K, exceeds the open-loop 
solution if and only if the strategic effect is positive. Hint: In the open-loop 
solution, K, is given by 


on! 
aK a(R.) 21K) = 0, 

This implies that for a positive strategic effect 
ant 

ik, (Ri A(R) 30) > 0 


This characterization does not generalize to the case in which both firms make 
decisions K, and K, in the first period. Even if both firms’ strategic effect is 
Positive, firm 2 (say) may invest less because firm I invests more and reduces 
the marginal value of firm 2's investment (this may occur, eg. when the 
strategic effect is much stronger for firm 1) 


Chapter 8 


The ign of the strategic effect, and therefore the over- or 
underinvestment prescription, is contingent on the sign of 
the strategic effect in the entry-deterrence case (which is 
equivalent to whether investment makes firm 1 tough or 
soft, When there is no direct effect in the entry-deterrence 
case)jand on the slope of firm 2's reaction curve. We are 
thus jed to distinguish four cases, depending on whether 


results, and those for the entry-deterrence case, are 
rized in figure 8.10. 


vest 


Investment makes firm 1 


Strategic 
complements 


(R’> 0) 


Strategic 


‘substitutes eee 


Lean and hungry 


(R’<0) 


Figure 8.10 
Optimal business strategies. (A stands for accommodation of entry, 
D for deterrence.) 


—i.e., be a top dog. Thus, firm 1's strategy is the, same 
whether it wants to deter or to accommodate entry, be- 
cause being tough both hurts and softens firm 2 in the 
quantity game. 

The picture is different in the price game. A lower price 
for firm 1 forces firm 2 to charge a lower price, which 
hurts firm 1. Thus, firm 1 should underinvest (i.e., keep 
a puppy-dog profile) so as not to look aggressive and 
trigger an aggressive reaction by firm 2. Firm 1's strategy 
is then very different depending on whether it wants to 
deter or to accommodate entry (deterrence calls for the 
“top dog” strategy), because being tough both hurts and 
toughens firm 2 in the price game. 

At this point, one is likely to think: “I have a clear 
picture of the entry-deterrence case, in which firm 1 
ought to overinvest. In the entry-accommodation case, 
however, the optimal strategy relies too much on the 
type of ex post competition (price or quantity); how can 
I make up my mind whether firm 1 should be a top dog 
or a puppy dog?” An element of the answer to this query 
can be found in chapter 5, where we interpreted quantity 
competition as capacity competition. To find the optimal 
strategy, we must wonder whether the investment K, 
reduces the marginal cost of accumulating capacity or that 
of producing. In the context of this model, we would thus 
predict strong strategic investment to accommodate entry 
when this investment reduces the costs of accumulating 
capacity, In contrast, a firm may be less eager to reduce 


45. We fssume that @11//?K, = 0, so that we can identify “toughness” or 
“softness} with the sign of the strategic effect in the entry-deterrence case. If 
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ON?/eK, # 0, the taxonomy under accommodation is relative to the sign of 
this effect rather than to "toughness" and "softivess.” 
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productidn costs and trigger tough price competition 


cases, 1 wants to make firm 2 unprofitable in the 
second pleriod. That is 

TH?(Ky, af (Ky), 23(K,)) < 0 

is the rdlevant objective for firm 1, where TI? includes 


entry orjexit costs. Thus, firm 1's behavior is driven by 
firm 2's profit, and the strategic taxonomy is identical in 
both casps. In particular, D can be replaced by “D or E” in 
figure 8,0, where E stands for exit inducement. 


K, need not even be an action taken by firm 1; it can be 
any variable that influences date-2 competition. In exam- 
ple 5 below, K, refers to firm 1's presence in another 
market, In example 6, K, denotes some variable outside 
the contro! of firm 1 (a quota, a tariff, or a subsidy). Again, 
the taxonomy can be applied as long as we can determine 
whether K, makes firm 1 tough or soft. The only modifi- 
cation is verbal: When K, is not controlled by firm 1, the 
“over- or underinvestment” prescription is replaced by the 
prescription that a higher K, benefits or hurts firm 1. We 
can also enlarge the set of applications to cases in which 
all firms play strategically in period 1, or to multiperiod 
games. The simple model of section 8.3 is again the key 
to understanding these slightly more complicated models. 

In the following applications, we will assume that prices 
are strategic complements and that quantities (i.e., capac- 
ities) are strategic substitutes. This crucial assumption will 
be discussed in section 8.5. 


Example 1: Voluntary Limitation of Capacity 


In chapter 5 we analyzed a two-stage (accommodation) 
game in which firms accumulated capacities and then 
charged prices. We observed that firms accumulate non- 
competitive amounts of capacity (under some circum- 
stances, the Cournot levels). Prior capacity accumulation 
was seen as one way out of the Bertrand paradox. What 
prevents a firm from accumulating a large amount (a 
competitive level) of capacity is that by accumulating a 
small capacity, each firm signals that it will not play an 
aggressive price strategy, and there is no point to cutting 
the price if one cannot satisfy demand. This signal softens 
the pricing behavior of the firm’s rivals. Such a voluntary 
limitation of capacity is an instance of “puppy dog” be- 
havior, Gelman and Salop (1983) make this point nicely. 
They consider a model in which an entrant enters on 
a very small scale so as not to trigger an aggressive 
response by a large-capacity incumbent.*” (The entrant is 
the strategic player in this example.) As Wilson notes, 


46. See iro 1986 for a useful and more extensive list of applications. 


similar to the Stackelberg follower behavior in section 8.2. The 
game corfsidered by Gelman and Salop is, however, different. The entrant, 
firm 2. both a capacity (K;) and a price (p,). The incumbent, frm 1, has 


not pick p; above the monopoly price p™, because firm 1 would 
i to this monopoly price. Hence, when faced with |p, < p™. Ki}, 
the bent’s optimal strategy is either to undercut p: by « (which firm 2 
wants tofavoid) or to charge p, > p, so as to maximize its profit given the 
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residual demand. For instance, for the efficient-rationing rule (see chapter 5), the 
residual demand is D{p,) — K,, Firm 2 thus chooses p, <p and Ky so as to 
maximize (p, — OK, subject to the no-undercutting constraint: 


max{(p, — d1D(p,) — Ks}} > (ps — ID(pn). 
To make undercutting unattractive to firm 1, firm 2 chooses a low enough price 
and restricts its capacity, Gelman and Salop call this strategy “judo economics.” 


Their paper also includes an interesting theoretical account of the 1979 coupon 
war among the major airlines in the United States 
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A useftl example is an incumbent hotel in a resort location: an 
entranh that builds a comparable but small hotel with lower 
rates (q pension) can expect accommodation from a large hotel, 
since it is in the larger one's interest to serve the overflow 
from the smaller one rather than to cut its price to compete 


le supplies, the entrant can also tailor its product to a 
market segment, In the hotel example a menu of health 
ight suffice. (1984, p. 41) 


le 2: The Principle of Differentiation 


ter 7 we considered two-stage (accommodation) 
of entry and location followed by price competi- 
e enunciated the principle of differentiation, ac- 
to which firms may not want to locate next to 


ice competition. Here the first-period variable is 
location. 

It iq particularly instructive to recall the location game 
studieHl in section 7.1, in which two firms choose loca- 
tions long a segment. When analyzing a firm's optimal 
decisipn, we saw that there are two effects; Moving to- 
ward the center of the segment increases the firm's mar- 
ket sHare and profit at given prices. This corresponds 
to the| direct or cost-minimizing (here, profit-maximizing) 
effect] of section 8.3. Moving toward the other firm 
increakes the intensity of price competition. This strategic 
effect |dictates that a firm locate as far as possible from 
the other firm. We saw that firms differentiate their 
produtts, (In our example, the strategic effect is so strong 
that fos differentiation occurs.) Product differen- 
tiatiod is another instance of “puppy dog” behavior.** 
Identifying a firm's capital as the closeness of its loca- 
tion to the center of the segment, each firm is willing 
to actumulate less capital (ie, locate away from the 
cente#) than it would do if its rival’s action (price) were 
fixed. 


Example 3: Learning by Doing*® 


It is often argued that experience effects can be used for 
strategic purposes. Indeed, in the 1970s some consulting 
firms recommended to their clients that they sacrifice 
short-run profits early in the product life cycle in order to 
gain strategic position, on the ground that by producing 
alot early (i.e., by cutting its price) a firm can quickly slide 
down the learning curve and deter the entry (or at least 
restrict the expansion) of other firms—see, e.g., Boston 
Consulting Group 1972. 

Learning by doing is similar to investing in technology 
or in capacity in that both reduce the firm’s future cost. 
(Here we assume specific learning. Learning externalities 
are analyzed later.) There is an important difference, how- 
ever, between learning by doing and other investments: 
The cost of learning by doing is not exogenous to the 
market, but rather follows from the firm's production 
experience. 

For ease of exposition, let us consider a two-firm 
rivalry. Let each firm's second-period marginal cost de- 
crease with its first-period output. Suppose first that com- 
petition takes place in quantities in both periods, By 
increasing its output in period 1, a firm signals that it will 
produce a higher output in period 2 because of the learn- 
ing effect. With strategic substitutes, this reduces the 
other firm’s output at date 2. Thus, the “top dog” strategy 
of accumulating experience early is optimal under accom- 
modation and quantity competition. It is also optimal to 
deter entry, because the incumbent firm's lower second- 
period cost hurts the entrant. 

Determining the optimal strategy under price competi- 
tion is slightly more complex. The “top dog” strategy is 
still optimal for entry deterrence: By charging a low price 
today, the incumbent accumulates experience, which 
induces it to charge a low price tomorrow. Entry accom- 
modation when there is only one firm in the market at 
date 1 (as in section 8.3) yields the opposite result: Expe- 
rience induces a low price, which triggers a low price from 
the rival. The “puppy dog” strategy of underinvestment 
in experience (ie., high first-period price) is then called 


48. Thif is a bit loose. because the case of entry deterrence or exit induce- 
ment involves a direct effect: If Kj =a (in the notation of chapter 7), then 
am? /0K <0, But (0M1?/ap,) ap? /0K,) is also negative, so we can identity 
“toughiess” and the strategic effect. 
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49. The strategic aspects of learning by doing with and without spillovers have 
been analyzed by Spence (198!, 1984), Fudenberg and Tirole (1983a), Stokey 
(1986), and Mookherjee and Ray (1986), among others. The present discussion 
follows Fudenberg and Tirole 1986, This field owes much to the early analysis 
of Arrow (1962) 
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for. In contgast, accommodation when both firms are in 
the market pt date 1 yields ambiguous results. On the 
one hand, aj low price today increases the firm’s output 
and hence jts experience, making the firm aggressive 
tomorrow ahd triggering a low price from its rival (this is 
the previouk strategic effect); however, a low price also 
reduces the Fival’s market share and, therefore, reduces its 
experience. |The rival faces a higher second-period cost 
and, therefare, is less aggressive in that period. This sec- 
ond effect, which does not exist for quantity competition 
because a fifm cannot affect its rival’s current output, calls 
for the “top dog” strategy. It is not clear a priori which 
effect dominates. 

To sumsparize: With specific learning by doing, the 
"top dog” ftrategy of accumulating lots of experience is 
optimal to deter entry or induce exit. It is also optimal for 
entry accormodation under quantity competition, but it 
may or my not be optimal under price competition. 
Thus, for dntry accommodation it matters whether the 
learning by doing refers to a reduction in investment 
ity competition) or to a reduction in produc- 
tion costs (price competition). 


Let us low consider the possibilities of diffusion of 
learning atross firms (spillovers).°° Such externalities 
can take place through interfirm mobility of employees, 
through spying, or through reverse engineering (i.e, 
taking a prpduct apart to learn how it was built). Learning 
by doing then somewhat resembles a public good and is 
therefore Ikely to be undersupplied. The new strategic 
effect arisitg from the diffusion of learning runs counter 
to the “top dog” tendency associated with specific learn- 
ing: No ffrm is willing to accumulate experience that 
helps its fival to reduce its cost and thus to be more 


it to charge a high price. As we saw earlier, this can be 
achieved by restraining investments that reduce produc- 
tion costs. There are, of course, other ways to commit to 
a high price. One way is to grant current customers a 
most-favored-customer status or price protection. (See 
Hay 1982 and Salop 1986. The analysis here relies more 
particularly on Cooper's |1986] formal treatment of such 
policies.) 

The most-favored-customer policy guarantees a firm's 
current customers that they will be reimbursed the dif- 
ference between the current price and the lowest price 
offered in the future (up to some specified date). For in- 
stance, in the 1960s and the early 1970s the two manufac- 
turers of turbine generators, General Electric and Westing- 
house, offered a price-protection policy effective during 
the six months following a sale.*? 

Before we consider why such a policy may help firms 
collude, it may be useful to recall the Stackelberg price- 
leadership story. Consider a duopoly producing differen- 
tiated products. Figure 8.11 depicts the reaction curves 
and the Nash (simultaneous-move) equilibrium (pj, 3). 
Suppose now that firm 1 chooses its price before firm 2. 
If it raises its price slightly above pj to j,, its profits are 
affected only to the second order by the fact that pf is 


P2 


Figure 8.11 
Stackelberg price leadership 


rman 1984 for evidence of the diffusion of experience in the 
chemical induftry, 


with linear demand and quantity competition. the firms 
first-period obitput can be shown to decrease with the degree of diffusion 
by learning (fn spite of the fact that diffusion increases total experience and 
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therefore increases second-period output. making first-period learning, more 
desirable), See, e.g... Fudenberg and Tirole 1983a, 


52. These firms ended the practice as part of s settlement to avoid antitrust 
action 
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timal response to p3 (according to the envelope 
theotem). To this direct effect can be added the indirect 
that firm 2 reacts by raising its price. This indirect 
raises firm 1's profit to the first order. Firm 1, the 
Iberg leader, thus chooses a price exceeding pj.** 

Stackelberg story gives the intuition of why even 


s choose their prices simultaneously in each 
The demand functions D,(p;, p)) and the cost func- 


4s = Oi ( py, pd), 
Firm 1's second-period profit is thus 
T1(p,, ps) if p, > p 
npr {Urond Enda 
PP) UE (py.p2) — Uh — pid if a <P 
where {1'(p,, p>) = (py — OD,(p,. P2)- 


Thug, firm 1's marginal profit in the second period 
exhibits a discontinuity at p, = p,. To see that 


{Pi = MP2 = Ral Bd} 


is the second-period price equilibrium, draw firm 1's 
second-period reaction curve R,. (Firm 2, which by as- 
sumption has not imposed price protection, has its usual 
reaction curve, R;.) Whenever the optimal reaction to p, 
calls for p, > fi in the usual (no price protection) case, Ry 
and R, clearly coincide. Let p. be such that R,(p2) = py- 
By definition of R,(-), 


M1} (p,. P2) = 0. 
This implies that 
TH}( Py. P2) + Gy > 0. 


It is thus easy to see that for p; — ¢, firm 1 wants to react 
by fi rather than R,(p2 — é), This is also true for a range 
of prices for firm 2. Only when firm 2's price becomes 
very low will firm 1 cut below its first-period price 
and (with regret) bring the price-protection policy into 
play.** Firm I's second-period discontinuous reaction 
curve is depicted in figure 8.12. A useful way to under- 


Pp 


P 


Ry = firm 1's 
reaction curve for 
demand D, \. 


reaction curve 
for demand D, + 9, 


> P; 


mh By 


Figure 8.12 
Second-period reaction curves when firm 1 offers price protection 
at p 
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and the second inequality from the fact that each firm’s profit increases with its 
rival's price. 


SA. This very low price, pi. is given by 


AV By. pa) = max [Ep 7) — (A — dD 
nh 
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stand thig reaction curve is to notice that for p, < py 
TU = (p{ — OID AP. Po) + Gul — Gi — Oh 


The secohd term in this expression is irrelevant as far as 
marginal choices are concerned, so everything is as if firm 
1 faced demand D,(p,.p2) + 4y for py < py. Thus, the 
reaction jis that for demand D,(p;,p2) + 4; whenever 
the reaction falls strictly below p,, and that for demand 
D,(p,,p4) whenever the reaction exceeds ji, 

Now recall that we choose p, just greater than py. From 
figure 8.}2, the second-period price equilibrium is given 
by py =} and py = R2(p,).5° In words: Firm 1 has suc- 
ceeded im becoming the Stackelberg leader, thus driving 
firm 2's price up. Firm 1 increases its profit to the first 


price-pratection policy, even if the other firm does not 
ch a policy. By making future price cuts costly, 


Despite its strategic attractiveness, the most- 
stomer clause is not widespread. Several rea- 


include [the cost of indexing the price to inflation and 


little applicability.*° (3) The practice may face antitrust 
prosecution. (4) Price-protection policies are not very 
profitable when other firms threaten to enter the market. 
Indeed, a “puppy dog” strategy (here, the established 
firms’ commitment to a high price) encourages entry, (5) 
Even in the accommodation case, the application of price- 
protection policies may be delayed by the fact that each 
firm wants to be the follower rather than the leader (as is 
the case when only one firm elects to offer the policy in 
the simultaneous-move game). This may give rise to 
situations similar to the war of attrition.*7 


Remark 2 Price protection is one method of softening 
future price competition. Another method (as in example 
2) is to increase product differentiation. Klemperer (1984) 
has argued that discounts for repeated purchases increase 
the cost of customers’ interbrand switching and thus dif- 
ferentiate the products in the future. This raises prices in 
the future. However, price competition is more intense at 
the beginning, because the value of a customer to a firm 
is raised.** 


Example 5: Multimarket Oligopoly 


The presence of a firm in one market may affect its strate- 
gic position in another market if the two markets are 
somehow related. This is the case when producing for 
two markets involves economies (or diseconomies) of 
scale or scope. Alternatively, the demands on the two 
markets may be interdependent.*? 

Bulow et al. (1985b) consider a duopoly model in which 
firms 1 and 2 are rivals in market 1 and firm 1 is a 
monopoly in market 2. (For concreteness, the two markets 
can be thought of as two different regions.) Here K,, 
rather than a choice variable for firm 1, is a parameter 
related to profitability in market 2 (it can be thought of as 
a demand parameter). Bulow et al. show that an increase 
in firm 1's profitability in market 2 may actually reduce 
its total profit. This is because of the strategic effect in 


55, The rehction curves for demands D, and D, +4, do not converge to each 
other (Le,, tay far apart) when p, converges to pj. 


56. As Coper (1986) observes, General Electric and Westinghouse published 
books that] contained relative prices for each component in order to face the 
issue that furbine generators are custom-made. They changed the prices by 
adjusting the multiplier. 


57. Still asfother possibility is that firms compete in capacities. We know that 


332 


results under accommodation are usually reversed with quantity competition, In 
this case, a firm wants to look tough (which a price-protection policy does not 
help it achieve). Checking this intuition would require solving the game in 
which firms choose capacities, prices, and protection policy. 


58. See note 43 above. 


59. For an example of multimarket rivalry with interdependent demands, see 
subsection 8.6.2 below. 
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market J. Suppose that the firms compete in quantities, 
that 1's production cost depends on the sum of its 
outputs jin the two markets, and that this technology 
exhibits |decreasing returns to scale. All quantities (the 
two outputs for firm 1 and the output for firm 2) are 


chosen fimultaneously and noncooperatively. Suppose 
that demjand increases in market 2. This induces firm 1 to 
sell mor¢ in that market, which raises firm 1's marginal 


2, observing the increase in demand in market 2, will infer 
im 1 will decrease its output in market 1, and so 


stitutes).]° Similarly, if firms compete in prices and firm 
I's techyology exhibits increasing returns to scale, an 
increase jin firm 1's profitability in market 2 lowers its 
marginal|cost in market 1 and therefore makes it aggres- 
sive in this market, which triggers a low price by firm 2. 
Again, this strategic effect may offset the profitability 
increase for firm 1. Either with quantity competition and 
economi¢s of scale or with price competition and decreas- 
ing retuins to scale, an increase in the profitability of 
market 2|unambiguously increases firm 1's profit. 


Example 6: Quotas and Tariffs 


Strategiclinteraction in an international context is affected 
by counfries’ trade policies. Exercise 8.4 showed how 
a protection policy can help a domestic firm to gain an 
edge in 4 domestic capacity-accumulation contest with a 
foreign firm (when goods are costly to trade between 
. More generally, subsidies, tariffs, and quotas 
be interpreted as the variable K, of the general 
y have a non-negligible impact on the strategic 


induces the domestic firm to expand its output, which 
induces the foreign firm to contract its own. That is, 
the export subsidy makes the domestic firm a top dog 
(to its advantage). The following exercises develop other 
examples. 


Exercise 8.8° “A foreign firm that competes in prices 
with a domestic firm in the domestic market suffers from 
facing a quota.” True or false? 


Exercise 8.9° Suppose that two firms, producing sub- 
stitute but differentiated products, compete in prices. (The 
equilibrium is unique and “stable,” and the profit functions 
are concave.) Show that a government-imposed floor on 
firm 1's price may increase that firm's profit. Explain. 


Example 7: Vertical Control 


The contracts signed between owners and managers or 
between manufacturers and their retailers influence com- 
petition between downstream units (managers or retail- 
ers) if these contracts are observable. For instance, Rey 
and Stiglitz (1986) show how exclusive territories may 
soften not only intrabrand competition but also inter- 
brand competition. Exclusive territories may allow firms 
to behave like puppy dogs in a price game.°* Bonanno 
and Vickers (1986) show that in duopoly a manufacturer 
may prefer to sell his product through an independent 
retailer rather than directly to consumers, in order to in- 
duce more friendly behavior from the rival manufacturer 
(see also McGuire and Staelin 1983 and Moorthy 1987). 
For some general results on the link between observable 
agency contracts and interbrand competition, see Ferscht- 
man and Judd 1986 and Katz 1987. 


Example 8: Tying 


Whinston (1987) reconsiders the old leverage theory, 
according to which tying may allow a firm with monopoly 
power in one market to monopolize a second market. His 
simplest model is as follows: Suppose there are two firms 
and two completely unrelated markets (the reasoning can 
be extended to the case in which the goods are comple- 
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62. See review exercise 19, In a financial context, Brander and Lewis (1986) 
show that the contract between a bank and a firm affects market competition. 
In their model, a high level of debt makes a firm a top dog in a quantity 
competition. See also Mathewson and Winter 1985 for a strategic analysis of 
exclusive dealing, 
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ments). Market A is monopolized by firm 1. Consumers 
all have|willingness to pay v for good A. Normalize the 
demand/in this market to be 1 (as long as the price does 
not excded v), Market B is a differentiated market and is 
served ty firms 1 and 2. Let q; = Dj(p;,p;) denote firm 
i's demand in this market, For simplicity, assume that 
the consumers are the same and have unit demands in the 
two markets, so that D,(«, +) < 1. Let c and c, denote firm 
I's unit production costs in markets A and B, Does firm I 
have incentive to te its products? For simplicity, 
assume that firm 1 offers the two products either sepa- 
rately of together (i.e., that there is no mixed bundling, in 
the langhage of chapter 4). 

Suppdse first that firm 1 takes price p, as given. This 
situation] arises when firms 1 and 2 choose their prices, 
and firm} 1 chooses whether to tie its two products simul- 


least tha tying profit by selling the two goods separately 
at prices» and P} — v, respectively: 


(v —¢) + \(Pf — v) — 41D, (P} — v, p2) 


> (P} —c —c)D, (Pf — v, p2), 


where [}, < 1. Absent strategic considerations (as is the 
case here, where firm 1 takes p, as given and acts as a 
monopdlist on its residual-demand curve), tying generally 
hurts firm 1 by reducing the number of degrees of free- 
dom in ifs pricing strategy. (We know from chapter 4 that 
this contlusion is not general, as a monopolist may gain 
from tying. On this, see also review exercise 27. But this 
modeling: choice will make the conclusions more striking 
and hel} us to identify the strategic effect.) 

An important property is that under pure bundling, the 
fictitiou$ price p, = Pf — v is lower than the price p, in 
market B, under no bundling, for any p2. That is, bundling 
shifts firn 1's reaction curve westward in market B. To see 
this, note that under bundling P, maximizes 


(P, — ¢| — OD, (P, — 2, p2), 


which means that p, maximizes 
{py — Icy — (@ — )]} Dy (Py, pa). 
In the absence of bundling, however, p, maximizes 


(py — 4:)Dy (Pr P2)s 


(The constant term (v — ¢) in firm 1's profit function can 
be ignored.) Thus, everything is as if bundling reduced 
firm I's cost of producing in market B by v — c as far as 
pricing in market B is concerned. This is very natural, as 
a unit loss of sales in market B costs v — c to firm 1 in 
market A under bundling, so that the “real” marginal cost 
of selling in market B is reduced by v — c. Now, we know 
from chapter 1 that a monopoly price increases with mar- 
ginal cost. The consequence of this is that a firm’s reaction 
curve in oligopoly shifts outward when the marginal cost 
increases (because this firm is a monopoly on its residual- 
demand curve). For any p3, therefore, p, < p,. Further- 
more, in this model bundling is formally identical to an 
investment in cost reduction. Firm 1 pays a fixed invest- 
ment cost » — c (which corresponds to the loss in revenue 
from selling in market A separately) for a fictitious tech- 
nology that reduces its marginal cost in market B from 
c to 


 —(v—o). 


As the number of units sold in market B is generally lower 
than the number sold in market A (D, <1), such an 
investment cannot be profitable in the absence of strate- 
gic considerations. 

Suppose now that firm 1 decides whether to bundle 
before the two firms compete in prices, Thus, firm 1 
first chooses to market the two goods separately (no 
bundling) or together (pure bundling); then the two firms 
choose their prices simultaneously. Think of a technolog- 
ical decision concerning the packaging of the product or 
(more likely in the case of complements) the decision 
whether to make the product intended for market A in- 
compatible with firm 2's product.*? Here bundling hurts 
not only firm 1 but also firm 2, as it commits firm I to 
charge a low fictitious price in market B. Thus, bundling 
is not a good strategy if firm 2's entry or exit decision 
is not at stake. It hurts firm 1 both directly and indirectly, 
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63. See example 9 for related arguments. 
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as it/forces firm 2 to reduce its price. However, if firm 


puter hardware and software; cameras, lenses, and film 
procdssing; tape decks, amplifiers, and speakers). The 
products in these lines cannot be consumed separately, 
but they can be purchased individually—consumers can 
“mix|and match” products as long as they are com- 
patible. In contrast, a manufacturer that makes its system 
incorppatible with other systems imposes a de facto tie-in 
(see dhapter 4) of its various components. 

tutes and Regibeau (1986) analyze the compatibility 
decistons of two producers of competing lines. They con- 
sider p duopoly in which each firm produces two comple- 
mentary products, X and Y, which constitute a system. 
Their| model extends the differentiation on the line model 
of chapter 7 to a two-dimensional case. The consumers 
are uhiformly located on a square of dimension 1, and 


y (product Y) 


Firm 2 


1-y, 


Nn 


x (product X) 
Firm? % 1-x, ? 


Figure 8.13 


P= Pi + Hy + 1). 

Similarly. the generalized price for the system sold by 
firm 2 is 

Pr = pr + HL — xy) + (1—y)h. 


Under incompatibility, the consumer chooses the system 
with the lower generalized price, j = min{j,,p,}. His 
demand is then downward sloping: q = a — bp.** If the 
systems are compatible and the products are sold sepa- 
rately, the consumer can mix and match, For example, the 
generalized cost of buying product X from firm 1 and 
product Y from firm 2 is 


PS + pl + tl, +(1— wh. 


where p* and p¥ are respectively the price charged by 
firm 1 for product X and the price charged by firm 2 for 
product Y. By mixing, the consumer can choose among 
four systems—product variety is increased. Again, the con- 
sumer chooses the system with the lowest generalized 
price. 

Thus, as in chapter 7, the demand function for each 
firm's system (under incompatibility) or each firm's prod- 
uct (under compatibility) can be determined as a function 
of the prices. One can then solve for the Nash equilibrium 
in prices. Suppose, for simplicity, that the unit production 
cost is the same for both firms and both products, The 


64. In dhapter 7 we assumed that demand was equal to 1 (although we could 
just as vell have considered the case of a downward-sloping demand at each 
focatior). For audio systems, the demand function is mainly a unit demand per 
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consumer. The way to justify a downward-sloping demand at a given location is 
to envision a large number of consumers with different tastes for the system ab 
this location. 
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Incompatibhlity Compatibility 


Figure 8.14 


equilibriun) demands for each product are symmetric and 
are as represented in figure 8.14 (assuming that the whole 
market is tovered in equilibrium). In the figure, {X,, Y,} 
means tha} the consumer consumes good X from firm i 
and good |Y from firm j. Under compatibility, the con- 
sumers lodated in the northwest and in the southeast 
corner of the square buy a system that is better suited to 
their prefetences than the system they would buy under 
incompatibility. 

What arf the incentives to achieve compatibility? First, 
compatibillty raises demand, because it makes products 
better adapted to the consumers’ tastes. Second, compat- 
ibility softdns price competition, as Matutes and Regibeau 
show. To fee the latter, note that when firm 1 decreases 
good X, it increases the demand for systems 
X, (because the generalized prices for those 


increase iry demand reduces firm 1's incentive to cut its 
s, the firms price their components less aggres- 


sively than they would do if the components were bundled 
in an incompatible system. These two effects imply that 
firms have a common interest in achieving compatibility.°° 


Remark The desire for compatibility stems from the as- 
sumption that the firms accommodate each other (i.e, do 
not try to force each other out of the market). In contrast, 
we know from Whinston’s version of the leverage theory 
(see example 8) that tie-ins may serve as a barrier to entry, 
Similarly, a dominant firm that wants to induce its rival's 
exit might well want to make its products incompatible 
with those of the rival.°7 Incompatibility hurts the rival in 
two ways: It reduces demand and it leads to more aggres- 
sive price competition. It is therefore apt to induce exit, 
Thus, a firm's optimal strategy (here, concerning the com- 
patibility decision) again hinges on whether it wants to 


accommodate its rivals or to deter entry or induce exit.°* 


a 
8.5 Epilogue: Prices versus Quantities 


A crucial assumption in the interpretation of examples 1 
through 9 is that prices are strategic complements and 
quantities are strategic substitutes. This characterization is 
particularly crucial in accommodation games, where firms 
wanting not to look aggressive when they compete in 
prices may take actions that will later turn them into 
puppy dogs and where firms competing in quantities may 
try to become top dogs in the future, It is therefore not 
surprising that two-period price games (respectively, 
quantity games) are often more collusive (respectively, 
more competitive) than their static (one-period) counter- 
parts,°° The strategies in an entry-deterrence or an exit- 
inducement situation usually differ less between price and 
quantity games than in an entry-accommodation situa- 
tion. As we saw in section 8.3, the important thing is then 


incompatibility. The consumers purchasing X,Y; and X,Y, 
lity buy a more suitable system than under incompatibility but 
her price, so the welfare analysis is ambiguous without further 
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67. It has been alledged that IBM makes its products incompatible with those 
of its rivals in order to maintain dominance. 


68. See Ordover and Willig 1981 for a discussion of predatory incompatibility 
decisions. 


69. The infinite-horizon Markov analysis of Maskin and Tirole (1987, 1988b) 
also emphasizes the role of the cross-partial [j, and suggests that these results 
are somewhat robust. There, repetition yields a collusive outcome in the price 
game and a more competitive outcome than the Cournot one in the quantity 
game: 
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tough, For instance, by reducing costs, one hurts 
fival, whether competition is in price or in quantity. 
tom line is that, before applying the above tax- 
, one should look at the microstructure of the 
and determine the type of competition that is 
aged. 

characterization of prices and quantities as strate- 
plements and substitutes is a presumption, not a 
law, as the following will show. 


fies Assume that profits have the exact Cournot 
jee chapter 5) 


M1 (qu. fy) = Ps + 4)) — Gilad, 
where] C; is firm i’s cost function. A simple computation 
yields the cross-partial derivative: 
TH, =P + q,P". 
We alfeady know that P’ <0. To obtain the strategic- 
substithte property, it suffices that the price function be 
linear {P" = 0) or concave (P” < 0). The property may 
fail for| sufficiently convex price functions.”° 
Prices | Let q, = D(p;, py) denote the demand curves. The 
profit functions are 
1p, Py) = pDi( pis Py) — CD, Pr. Py))- 
This yields the cross-partial derivative: 
D, @D, éD, 6D, 

mat ep, - C2 ee. 

4 Pi u : Op.0p; * Op. Op; 
As in the case of quantities, this cross-partial derivative 
depends on the details of the demand function. Assume 


demand is linear (over the relevant range), 
}) = a — bp, + dp;, 
t the marginal cost is constant. If the goods are 


demand substitutes (d > 0), then TIj, > 0, so the goods 
are strategic complements. If they are demand comple- 
ments (d < 0), then they are strategic substitutes. More 
generally, if we assume that the goods are demand sub- 
stitutes, and we note that p, — Cj > 0 in equilibrium (from 
firm i's first-order condition), it suffices that 0*D,/dp,dp; 
be non-negative in order for the goods to be strategic 
complements in the neighborhood of a price equilibrium.” 


tance, Bulow et al, (1985b) note that for Plq, + 42) = (qi: + 42)* 
4 <1, Mj is proportional to a — q,/q,. Thus, if because of cost 
differencds the equilibrium involves a big firm and a small firm (9, /q2 very large. 


firm's oufput raises the big firm’s optimal reaction to this output. 


sy to construct examples in which this property is not satisfied. 
re, the goods generally are not strategic complements in prices over 
range of potential prices, as Maskin and Tirole (1988) note. To see 
that the goods are fairly good demand substitutes. Fix p, and let 
ier. p, > pj, firm j obtains the whole demand, and firm i's demand 


337 


and profit are not affected much by a unit change in firm j’s price. (In the limit, 
with perfect substitutes, demand remains 0 and thus is not affected at all.) So 
Tif is very small. Similarly, when p, « p,, a unit change in p, has little effect on 
firm i's demand and profit; again, IT; is very small. When p, is close to p,, a unit 
change in p, has a big effect on firm i's demand and profit (think of perfect 
substitutes); hence, TT; is big. Thus, Ij cannot be monotonic in p,. Now, this did 
not matter in our applications, because the second-period simultaneous-move 
price equilibrium occurred in the region where &D,/dp) is large and I, is 
positive, In more dynamic games, this may have some relevance. For instance, 
in Maskin and Tirole 1988a,b the reaction curves are monotonic (downward 
sloping) in the quantity game and nonmonotonic in the price game. 
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8.6 Supplementary Se 


Strategiq Behavior and Barriers to Entry or Mobility 


This sectipn, which covers some of the recent research on 
barriers td entry, serves two purposes. First, on a technical 
level, it gbes beyond the somewhat contrived two-period 
model off sections 8.2-8.4 to analyze the full-fledged 
dynamic }nteraction between firms. Second, and perhaps 
more sal coy it studies in detail two distinct barriers 
to entry. Bection 8.6.1 compares short-term and long-term 
capital actumulation. (The analysis follows Fudenberg and 
Tirole 1986 and 1987.) Section 8.6.2 deals with differen- 
tiated markets; it shows how a firm may want to preempt 
its rivals fo occupy the profitable market niches and how 
a firm carj use product proliferation to restrict entry.”? 


8.6.1 Capital Accumulation 


The commitment value of capital is higher the longer 
its lifetirje and the more costly its disposal or resale. 
Thus, the] extent to which capital is sunk determines the 
monopoly power and profit enjoyed by established firms. 
We will jexamine two polar cases here: one in which 
investments are sunk only in the very short run and one 
in which fapital cannot be resold and does not depreciate 
(ie., is completely sunk), 


8.6.1.1 Bhort-Term Capital Accumulation and 
Contestability 


This subjection explores two related investment-based 
dynamic nodels of a natural monopoly. In these models 
there is rpom for only one firm in the market, and there 
is actually a single firm in equilibrium. This firm makes 
a profit Ind deters entry through capital accumulation. 
Capital iq sunk only in the short run and must be “re- 
newed” periodically. The length of time over which cap- 
ital is surk determines the period of commitment. When 
commitment is short, the established firm enjoys only 
small incymbency advantages over potential entrants (be- 
cause an ¢ntrant can kick the incumbent out of the market 
quickly), |Thus, it must accumulate capital to deter entry. 
In the linjit for very short commitments, the incumbent 
firm sale almost no profit, so Posner's rent-dissipation 


postulate (which says that monopoly profit is dissipated 
through competition—here, potential competition) is sat- 
isfied for very-short-run commitments, Posner's wasteful- 
ness postulate (according to which profits are dissipated 
in a socially wasteful way) may or may not hold, depend- 
ing on whether the incumbent's capital is excess capital or 
contributes to production, 


Wasteful Rent Dissipation 


The first theory of short-run commitments, developed by 
Eaton and Lipsey (1980), considers an industry with two 
firms. Time is continuous, and the horizon is infinite. 
One unit of capital (e.g., a plant) is necessary for produc- 
tion and gives access to constant marginal cost, c. A 
second unit of capital is useless in the sense that it does 
not reduce the marginal cost of production. One unit of 
capital costs f per unit of time and has deterministic dura- 
bility H (after the unit of capital is installed, it undergoes 
no physical depreciation for H units of time and full 
depreciation thereafter”*), The fixed cost of production 
(equal to [f fe-"dt, where r is the rate of interest) is paid 
when the unit is installed, so the firm cannot avoid pay- 
ing the fixed cost by leaving the market before H units 
of time have elapsed. Therefore, with equipment of age 
t <H, the firm never has an incentive to leave the mar- 
ket, even if another firm enters. Thus, H is a measure of 
commitment. 

If at date ¢ only one firm is active (i.e, has at least one 
unit of capital), that firm’s flow profit, gross of capital 
cost, is 


fi" = max{P(q)q — cq). 
4 


Suppose that f < fl™ < 2f. A monopoly is feasible, be- 
cause [1™ > f. If two firms operate (i.e., have at least one 
unit of capital each), they wage Bertrand competition 
with marginal cost c and make zero gross profit; thus, 
each loses f per unit of time. The Bertrand assumption 
is meant to simplify computations. More generally, the 
firms could make a positive gross duopoly profit; the 
assumption f1™ < 2f would still guarantee a negative net 
profit for at least one of them, since the gross monopoly 


72, Readers Jnat familiar with dynamic games may want to skip subsection 
3.0.1 in a firs readings it is techmacally more dificult than the rest of the section 
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73, This is the "one-horse shay” manner of depreciation 


profit is hn upper bound on gross industry profits under 


(<H/2) lyears before its current unit depreciates. The 
other fir} invests in a unit of capital if the incumbent has 


In equilibrium, the length A is chosen such that when the 
t's unit of capital is H — A old, the potential 


forever, and the entrant makes no profit. If he enters, he 
makes a profit of —f for A years (because the incumbent 
is still committed: The fixed cost on his current unit is 
sunk) and enjoys monopoly profit forever after. The 


incumber¢'s investment path is represented in figure 8.15 
Along th¢ equilibrium path, the incumbent always renews 
his capit4l before it depreciates. The potential entrant 
never entprs; he is kept out of the market by the incum- 
bent’s cofimitment to stay in for at least A years after 
entry (which inflicts short-term losses on the entrant). 

Let us fow compute A. In equilibrium, the incumbent's 
present djscounted profit from date 0 on (or from any 
date at which he buys one new unit of capital) is 


(8.1) 


Now supose the potential entrant wants to enter. Ob- 
viously, there is no point in entering strictly before the 
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Figure 8.15 
Incumbent’s equilibrium investment strategy. 


incumbent buys a second unit of capital (because the net 
duopoly profit flow is negative), so the entrant will wait 
and preempt the incumbent just before the latter buys its 
second unit (ie, when the incumbent's current unit is 
H—A old). If the entrant does so, the incumbent does 
not buy a second unit, but he sticks around with his old 
unit for A units of time before exiting the market. The 
entrant's profit from the entry date on is thus equal to V 
minus the monopoly profit forgone during the entrant's 
first A years of existence (that is, the only difference 
between the incumbent at date 0 and an entrant who 
takes over is that the entrant is in a duopoly situation for 
A units of time): 


7 =) 
-{ fime-rdt = v— fin 
L 


r 


Because the second unit of capital is costly and is 
useless for productive purposes, the incumbent chooses A 
as small as is consistent with deterring entry: 


v—fn= (8.2) 


or, substituting V, 


Ll (8.3) 


Under our assumptions, equation 8.3 implies that A < H/2. 

We are particularly interested in what happens in cases 
of very short commitments. Let H (and thus A) tend 
to zero. Performing first-order Taylor approximations on 
equation 8.3, we get 


Las rH -— 
f rH—-rA H—A 
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(8.4) 


This meas that the incumbent has two units of capital 


A/(H — &) = (f1" — f)/f percent of the time. More in- 
teresting, perhaps, equation 8.2 yields 
V>o. (8.5) 


Thus, evgn though there is only one firm in the indus- 
try in eqhilibrium, this firm does not make profits. The 
monopoly rent is entirely dissipated by the accumulation 
nd unit of capital. This is natural. If the value V 
monopolist were large, the entrant would come 
ney for a very short period of time (because H 
\d therefore A is small), and capture V. Thus, for 
commitments, potential competition drives the 
t's profit down to zero. 

r short-run commitments, we do not obtain 
tability outcome. Actually, we obtain Posner's 
nt-dissipation postulate exactly: The monop- 
es the monopoly price and yet makes no profit. 
welfare loss per unit of time is equal to the 
sumer surplus (see the triangle in figure 1.2) 
et monopoly profit (f1™ — f) (see chapter 1). 
ld not be surprising; the only possible avenue 
dissipation in this model is excess capital, 


Contestability 


Our second model is due to Maskin and Tirole (1988). 
Though it is similar in spirit to the first model, the formu- 
lation differs in some respects. Time is discrete, and the 
horizon is infinite, There are two firms, which compete 
in capacities. A capacity is locked in for two periods once 
chosen. Let f1(K,,K,) denote the per-period profit of a 
firm with capacity K, when its rival has capacity K, (gross 
of the per-period fixed cost f). As usual, [1 decreases with 
K,, and the cross-partial derivative 2*f1/0K, 0K, is nega- 
tive (capacities are strategic substitutes), The period length 
is T, and the discount factor between periods is 6 = e~'7. 

The firms choose their capacities sequentially. (The 
model actually is equivalent to a continuous-time model 
in which firms choose capacities K, which, as in Eaton and 
Lipsey 1980, depreciate in a “one-horse shay” manner, 
but according to a Poisson process—i.e., H is stochastic. ’*) 
Firm 1 picks capacities in odd periods and firm 2 in even 
periods.”* A firm picks a capacity for two periods of 
production and sinks in the first of these two periods a 
fixed cost: f(1 + 4) for capacity K > 0, zero for K = 0. 
Let 


= max[P(K)K — (¢ + co)K], 
K 


where c is the marginal cost of production and ¢o is the 
marginal cost of installing capacity, As before, assume 
that f < f™ < 2f. Thus, one firm is viable, but not two. 
Strategies are required to be “Markov” (i.e., payoff rele- 
vant)—that is, firm i reacts to the capacity K, chosen by 


10d of commitment to that capital is stochastic. The probability 
itments will lapse between date t and date | + At is independent 
‘of time ands equal to ZAt. One can think of this technology as an uncertain 
(the time independence of depreciation is clearly an extreme 
assumption){ Letting V'(K;) (respectively, W'(K,)) denote the present discounted 
value of ('s profit when firm / renews its capital and reacts to firm j’s 


V'(K;) = mpax((T1'(K, Ky) — PAP 
+ At WK 


+ (1 — AANVEK Je) 
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Thus, the continuous-time model is equivalent to the discrete-time, sequential- 
move model with gross profit function 
MK, Ky) 
1 a Ky 
MK, Ky) = ee 


and discount factor 


For the dynamic programming equations in the discrete time framework, see 
the next subsection. 


75. When we “endogenize” timing by letting firms choose their capacities 
whenever they want, subject to the constraint that a capacity ts locked in for 
two periods once chosen, the symmetric equilibrium is the same as described 
below. (Another way of endogenizing the timing is given in note 74.) 
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its rival in the last period and still in place for the current 


As in Eaton and Lipsey 1980, there exists a unique 
symmetric pquilibrium, For 6 sufficiently large, it takes 
the following form (illustrated in figure 8.16): In equilib- 
rium only dne firm operates, at capacity level K*. A firm 
chooses to] enter if and only if its rival's capacity is 
less than the entry-deterring capacity K*; if it enters, it 
accumulate$ capacity K" itself. In equilibrium, K* is such 
that the entrant is indifferent between entering and not 
entering: 
(T(K*, K*) + f+ ne.) — =0. (8.6) 
This equation reflects the fact that the entrant gets 
{<0 when entering (recall that 2 f1(K, K) 


,|and the entrant becomes a monopolist who 
ing entry by choosing K* forever. Then the 
ure profits are given by 


f\ + 6?(f1(K*,0) — f] + -*+ 


0—fl 


The incumbent chooses its own capacity so as to just 
accumulating beyond K* is costly because, as 
we will seb shortly, K* already exceeds the monopoly 


ary: The equilibrium involves a single firm, 
operating at a capacity level K*.’” This firm engages in 
some form] of limit pricing. It accumulates more capacity 
than a monopolist facing no threat of entry would. It 
therefore charges a price that is less than the monopoly 
price (see Helow). 

We will|now investigate the case of short-run commit- 
ments (where T tends to O—ie., 6 tends to 1). From 
equation 816 we see that when 5 converges to 1, TI(K*, 0) 
— f convetges to 0. That is, the monopolist’s profit con- 
verges to Q. (Note in particular that K* exceeds K™.) The 
intuition for this rent-dissipation result is the same as 


Figure 8.16 


before. The established firm enjoys an important incum- 
bency advantage only if it can inflict a duopoly loss on 
the entrant for a sufficient amount of time. The entrant is 
lured by the prospect of becoming a monopolist after 
only a brief fight, so the incumbent must raise its capacity 
to deter entry. 

An important difference from the previous model is 
that the rent dissipation need not be wasteful. Indeed, if 
the established firm's capacity K* is used (so output q 
is equal to K*), the dissipation is socially useful. Rent 
dissipation occurs through price reduction rather than 
through excess capacity, The outcome in the limit is the 
one predicted by the contestability school (see section 
8.1), Whether the monopolist uses all of his capacity K* is 
an empirical question. As in chapter 5, K* installed is used 
if the marginal cost of investment ¢o is sufficiently large 
relative to the marginal cost of production c. 

For smaller discount factors, entry is blockaded. That is, 
by accumulating the monopoly capacity, the established 
firm deters entry. 


No Fixed Cost: The Dynamics of Cournot Competition 


In the two preceding subsections we assumed the exis- 
tence of large fixed costs, which made the industry a 
natural monopoly. The incumbent firm overinvested to 
deter entry. In the absence of fixed costs (or in the pres- 
ence of low fixed costs), there is room for two firms. 


76. In other Words, the strategies do not depend on the payoff-irrelevant 
history of the fame. 


77. In both tHe Eaton-Lipsey model and the Maskin-Tirole model, there are 
two asymmetrjc Markov-perfect equilibria for sufficiently short commitments. 


341 


In these equilibria, one of the firms enjoys unconstrained monopoly power in a 
steady state (that is, it does not renew its capital early in the first model, and it 
accumulates K™ in the second). This firm never quits the market, and it reacts to 
entry by assuming that the entrant will exit once its commitment has elapsed. 
This aggressive behavior is self-fulfilling and ends up deterring entry. 
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deter entry, the firms accommodate each 
subsection analyzes accommodation in an 
ith short-run commitments and presents the 


Considqr the sequential-move capacity competition 
model of the preceding subsection, but assume that the 
firms incut no fixed cost (f = 0). (The analysis here fol- 
lows one presented by Maskin and Tirole [1987] and is 
based on the earlier model of Cyert and DeGroot [1970].) 
Firm I chdoses capacities in odd periods (which are locked 
in for twd periods and can be freely changed after two 
periods), and firm 2 chooses capacities in even periods. 
Firm i's intertemporal profit at time ¢ is 


= 

As before] we make the usual assumptions on the profit 
function: fi, < 0, 11) <0, Miy <0. We look for a pair 
of dynamik reaction functions, R,(*) and R3(*), that form 
a Markoy perfect equilibrium. Thus, if firm 2's current 
(locked-in} capacity is K,, firm 1 reacts by choosing 
capacity K, = R,(K3) to maximize its present discounted 


As in section 6.7, let V'(K,) denote the present 
profit of firm i when it reacts to its rival's 
capacity Kj, and let W'(K,) be firm i's present discounted 
profit wh¢n it is locked into K; and its rival reacts. The 


equilibrium conditions are the following: 

V"(K,) =|max(I1'(K, K,) + 6W*(K)}, (8.7) 
K 

R, (Kz) maximizes |I1'(K, K,) + 6 W*(K)], (8.8) 


W1(K,) 4 M4 (K,, R2(K,)) + 6 V'(R2(Ky)), (8.9) 


and similafly for firm 2. 
The fir$t common feature with traditional analysis is 


that, because capacities are strategic substitutes (IT/, < 0), 
the reaction curves are downward sloping. To show this, 
it suffices to write the optimality of reaction functions (an 
identical technique is used to prove the monotonicity of 
incentive-compatible allocations in incentive problems). 
Consider two capacity levels, K, and K,. Let R,(K2) and 
R,(K,) denote the optimal reactions to K, and K. By 
definition, R, (K,) is a better response to K; than Ry(K2): 


T1'(R, (Kz), Kz) + 6 W*(R,(K,)) 

> M1'(R, (Rp), Kz) + 6 W"(Ry (Kz) (8.10) 
Similarly, R; (K,) is a best response to K;; 
11!(R, (Rz), Ry) + 6 W4(Ry (Ro) 

> T1"(Ry(K,), Kz) + 6 W'(R,(K2)). (8.11) 
Adding equations 8.10 and 8.11, we obtain 
TT (R,(K3), Kz) — T1'(Ry (Ra), K2) 


+ TIM(R, (Kz), Ry) — T'(R, (Kz), Kz) > 0, (8.12) 
which is equivalent to 
Ky PRK) 
i} [ T1},(x, ydxdy > 0. (8.13) 
Ra JR (RD) 


But, by assumption, I}, < 0. Thus, equation 8.13 implies 
that R,(K3) < R,(K,) if K, > Ky. The reaction curves are 
necessarily downward sloping. 

To find equilibrium reaction functions, we must solve 
the system of equations 8.7~8.9.’* For quadratic profit 
functions, such as 


II = Kd — K, — K)), 


there exists a particularly simple solution. Each firm's reac- 
tion function is linear in its rival's capacity: R, = R, = R, 
where R(K) = a — bK. This solution also has the remark- 
able property that it is the limit of each firm’s reaction 
function at any date when the horizon is finite but tends 
to infinity.7° 


78. To find } differentiable solution (if such a solution exists), we can dif- 
ferentiate eqdation 8.9 and take the first-order condition in 8.7. After some 
substitutions, we obtain a system of difference-differential equations in the 
two reaction functions. This system is generally hard to solve, but is easy in the 
case of quadrftic profit functions. 


-horizon solution is too complex to be derived in closed form. 
and DeGroot (1970) computed it numerically. To show conver- 
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gence toward the infinite-horizon, linear, Markov-perfect equilibrium, one 
shows that the finite-horizon solution belongs to the class of linear reaction 
functions with slopes between —4 and 0 and intercepts between 0 and d, that 
it is obtained by backward induction through a contraction mapping in the 
space of such functions, and that the fixed point of the contraction mapping 
(which is the limit of the reaction function for large horizons) satisfies the 
difference differential equations for (R,, Rz) derived from equations 8.7 through 
89. 
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Figure 4.17 


a 


Th gee of the game are illustrated in igure 8.17. 
The solid lines depict the dynamic reaction functions for 
6 in (0)1), the broken lines represent the static Cournot 
reaction functions R} and R3, E denotes the steady-state 
allocatipn, and C denotes the Cournot outcome. 

For 4 = 0, the firms are myopic. They react according 
to the static reaction function 


a 
RIK) =|5—5 


The industry dynamics are then called the Kitonnement 
process. The steady state is the Cournot allocation C. For 
6 > 0, gach firm takes into account not only its current 
profit fe its rival's future reaction. Because reaction 


which mags Rid — K — R). Thus, a = d/2 andb =}. 


curves are downward sloping, the intuition is that a firm 
should ivest beyond its short-run interest so as to induce 
its rival]to curtail capacity (as in the Stackelberg game 
of sectipn 8.2). Indeed, it can be shown that when 6 
increase}, the steady-state symmetric level of capacity, 
given by K =a—bK or K = a/(i + 6), increases and 
thus moves away from the Cournot level. The process is 
dynamiqplly stable—for any initial level of capacity, the 
capacities of the two firms converge to the steady-state 
capacities. This generalizes the Cournot tatonnement pro- 
cess in that each firm rationally anticipates the influence 
of its capacity choices on its rival's behavior. 

The moral of such a simple infinite-horizon model is 
that the |intuitions derived in two-period models carry 
over: Strategic substitutes yield downward-sloping reac- 
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tion curves, so each firm overinvests for strategic reasons. 
The outcome can be thought of as symmetric Stackelberg, 
leadership. 


8.6.1.2 Long-Term Capital-Accumulation Games 


In the other polar case, investment creates a long-term 
commitment to be in the market. Specifically, we assume 
that investment, once in place, does not depreciate and 
cannot be resold. That is, investment is irreversible. The 
following model is due to Spence (1979); the version 
presented here is from Fudenberg and Tirole 1983b. 

Consider a duopoly, with firms indexed by i= 1,2. 
Time is continuous, and the horizon is infinite. Firm i's 
flow profit at any time f, gross of investment expendi- 
tures, is given by 


TI'(K, (t), K(f), 


where K,(t) is firm i’s capital stock at date f (as usual, 
Ti, < 0, Mj < 0, and Mi, < 0) 

Capital at date t is equal to cumulative investment to 
date: 


where |)(t) is the rate of investment. It is assumed that the 
cost of investment is linear. One unit of investment costs 
$1. To avoid instantaneous investment at date 0, we 
bound each firm's investment above by I,. This tech- 
nology is an example of convex investment cost. Invest- 
ment must be non-negative, and there is no depreciation. 
Thus, the capital stocks are nondecreasing. Firm i's net 
profit at time t is 


T'(K, (), Ka() = L(0). 


Firm i’s strategy is a path of investment {Jj(f)} satisfying 
0 <J() <1. Each firm's investment at date | depends 
on the current capital stocks (K,(f),K>(f) (again, we 
assume that the strategies are of the Markov type, in 
that they depend only on the payoff-relevant state of 
the game and not on the whole history). Both firms enter 
the market at time t = 0 without any capital. 

Firm i's objective function is equal to its present dis- 
counted profit: 


[ IT'(K, ), K2()) — (Je "at 


0 
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In this subsdction we will only consider the limit game in 
which both firms become infinitely patient (that is, r tends 
to 0). In thi§ case, the firms maximize their time-average 
payoffs, so|that only the eventual steady-state capital 
(no firm will choose to invest forever). Thus, 
tive function is I'(Kj*, Ky"), where ss stands 


the private kost of investment and focus on its strategic 
0 use a simple diagrammatic approach.*? 


exactly the|same as if both firms built their entire capital 
stocks at thp start (because there is no discounting). In the 
resulting “Cournot” equilibrium, each firm invests to the 
point at which the marginal productivity of capital equals 
zero, given|the steady-state capital level of its opponent. 
All of the many different paths that lead to this steady 
state are frecommitment equilibria. For example, each 
firm's stratdgy could be to invest as quickly as possible 
to its Courpot level. We can highlight the similarity of 
this solutidn to a Cournot equilibrium by defining the 
“steady-state reaction curves" that give each firm's de- 
sired steady-state capital level as a function of its rival's 
steady-statt capital level. Under our assumptions, these 
reaction es look the same as usual “nice” Cournot 
reaction es. The reaction curves R, and R, are dis- 
played in fipure 8.18, where IGP is the investment-growth 
path (the gath along which both firms are investing as 
rapidly as|they can). The precommitment (open-loop) 
is at C = (C,, C;), the intersection of the two 
curves. W¢ have seen that the use of the concept of 
ent transforms an apparently dynamic game 
it one. As a modeling strategy, this transforma- 
vised: “,.. one should not allow precommit- 
ter by the back door.... If it is possible, it 
xplicitly modeled ... as a formal choice in the 
ps and Spence 1984) 


Now allow firm i's investment at time f to depend on 
the capital stocks at that time (the firms employ closed- 
loop strategies). The capital stocks are the “state vari- 
ables” (ie., the capital stocks at any date and the invest- 
ment programs from that date on are all the information 
one needs to compute the payoffs), A Markov perfect 
equilibrium is a pair of Markov strategies 


{1(K, (0, K2()) fiat. 


that form a “closed-loop” Nash equilibrium from any pos- 
sible initial state (K°, K2) and not only from the initial 
state (0, 0). 

Consider figure 8.19, which depicts a Markov perfect 
equilibrium. The arrows indicate the direction of motion. 
The motion is vertical if only firm 2 is investing, horizon- 


0 C, By 


Figure 8.18 


0 1 


Figure 8.19 


s of the discounting case, see Fudenberg and Tirole 19836 and 
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81. The analysis here follows Fudenberg, and Tirole 1986, pp. 8-13 
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ly as possible, and nonexistent if neither firm 


which firm 2 has a head start. Thus, this equi- 
demonstrates how an advantage in investment 
speed pr initial conditions can be exploited. The condi- 
tions of the growth phase (which firm got there first, the 
costs of adjustment, and so on) have a permanent impact 


s out that the equilibrium depicted in figure 8.19 
que. There are many others. To understand why, 
consider point A in the figure. This point is close to firm 
2's readtion curve and past firm I's reaction curve. The 
strategies specify that from point A on both firms invest 
until Rj is reached. However, both firms would prefer the 
status quo at A. Firm 1 in particular would not want to 
invest dven if firm 2 stopped investing; it just invests in 
self-defgnse to reduce firm 2's eventual capital level. Both 
firms’ stopping at A is an equilibrium in the subgame 
starting] at A, enforced by the credible threat of going to 


study we presume that capital does not depre- 
in open issue, analyzed by Hanig (1985) and 
s (1987), is the behavior of investment in the 


industry when capital depreciates. Intuition suggests that 
capital ought to lose some of its commitment value and 
that the steady-state levels of capital should be less sensi- 
tive to the initial head start of one of the firms. Hanig and 
Reynolds consider quadratic payoff functions, 


Tl = K(1 — K, — Kj), 
and quadratic investment costs, 
CM) = cl?/2. 


They allow depreciation (k, = 1) — uK,) and discounting, 
and they look for Markov-perfect-equilibrium investment 
strategies that are linear in the capital levels (Jj(t) 
= —2K,t) — BK,(t) + y, where 2, By > 0). They use 
differential-games techniques*® to obtain such a solution. 
The main result is that the steady-state level of capital for 
both firms strictly exceeds the Cournot level; thus, both 
firms are beyond their reaction curves in the long run. 
The intuition is the same as for the model of short-run 
commitments (and no fixed cost) described above. Each 
firm at each instant keeps more capacity than it would 
if it could not influence its rival's accumulation. It thus 
forces its rival to reduce its capacity. Because both firms 
behave in this Stackelberg fashion, their capital levels 
exceed the Cournot ones. The commitment value of cap- 
ital is inversely related to its rate of depreciation. In par- 
ticular, capital that depreciates rapidly involves only a 
short-term commitment. 

If we ignore fixed costs and barriers to entry, these 
models point at the following conclusion for dynamic 
competition under Markov strategies: Relative to static 
competition (see chapter 5), repeated interaction pro- 
motes collusion under price competition (see chapter 6) 
and fosters competition under capacity competition in the 
Hanig (1985), Maskin-Tirole (1987), and Reynolds (1987) 
models.** This conclusion makes economic sense. By rais- 
ing its price, a firm creates incentives for its rival to do the 
same; by increasing its capital level, it induces its rival to 
reduce its own. Thus, the distinction between strategic 


(ie. an equilibrium with steady state under the upper envelope 
lion curves) through arguably intuitive arguments, including the 
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ability of the investment strategies in the capital levels required by the theory 
of differential games is not an innocuous assumption Its rules out the above 
eatly-stopping equilibria, in which a firm invests to some level. stops, and 
threatens to resume investment if its rival does so. 


84. We must be careful here because of the potential multiplicity of equilibria 
The previous no-depreciation capital-accumulation game admitted nondifferen 
table equilibria. which are quite collusive. See note 82 
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substitutes]and complements has some relevance for the 


they face 
are most 
U-shaped 
products; 


discrete choice: They invest in plants that 
ficient in terms of scale (as in the case of a 
ost curve); they choose among a limited set of 
hey locate at a restricted set of geographical 
so on. The advantage of being the first mover 
an extreme form: that of preemption. Cer- 
mption occurs in the above long-term capital- 


(in the absence of depreciation, the steady 
volve a bit less capital for this firm and a bit 


occupy the right market niche on time may 
to prevent an entrant from installing a plant 
or occupyjng the niche. A little delay may allow entry 
and thus Have large consequences for the firms’ profits.** 

In this bubsection we will study discrete choice and 
preemptiop. The discussion will be limited to situations 
in which the preempting firm does not physically deter 


entry but, rather, makes it unprofitable. (Exclusionary 
investments will be studied in chapter 10 in the context of 
patentable innovations.) 

A natural focus of preemption games is the timing of 
the introduction of plants or products. As in the model of 
Eaton and Lipsey (1980), the established firms will tend to 
invest early. Another focus is the persistence of monopoly. 
Will the established firm always be able to deter its rivals 
from entering by investing early? Should we expect a 
monopolistic or an oligopolistic structure in the long-run? 


8.6.2.1 Product Proliferation 


As we saw in chapter 7, firms want to differentiate their 
products so as to avoid intense price competition (with 
some exceptions). Therefore, potential entrants look for 
unfilled market niches. To deter entry, the established 
firms may try to pack the product space and leave no 
profitable market niche unfilled For instance, Scherer 
(1980, pp. 258-259) describes General Motors’ 1921 de- 
cision to offer a complete spectrum of automobiles, and 
GM chairman Alfred P. Sloan’s strategic approach to 
the decision. He also notes how the Swedish Tobacco 
Company, upon losing its legal monopoly position in 
1961, reacted by offering twice as many brands (and 
by increasing its advertising twelvefold in the following 
years). Schmalensee (1978) observes that the six leading 
manufacturers of ready-to-eat breakfast cereals introduced 
eighty brands between 1950 and 1972 (the year in which 
the Federal Trade Commission issued a complaint against 
the four largest manufacturers, who had cornered 85 
percent of the market and who enjoyed large profits). 
Schmalensee (1978) shows formally how a cartel (a 
group of firms that act as a single monopolist) crowds a 
product space. In the context of a circular location model, 
he asks how many products a cartel should introduce to 
make further entry unprofitable; and he shows that it is 
indeed in a cartel’s interest to deter entry in this way. 
Schmalensee’s model is static and therefore is silent on the 
optimal timing of preemption, Subsequent research has 
developed models in which, over time, demand grows or 
the cost of introducing new products decreases, and the 
date of introduction of a new product is a choice variable. 


85. A similar|phenomenon would occur in an investment game with a fixed 
cost of entry. 
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Further persistence-of- monopoly results were discovered 
by Eatoh and Lipsey (1979), who described preemption in 
a locatipn model®®; Gilbert and Newberry (1982), who 
demonstrated a similar result and clearly identified the 
teason fpr persistence in the context of a patent race; and 
Gilbert Jand Harris (1984), who determined the “threat 
dates” at which an established firm can build indivisible 
plants sp as to deter entry.87 We now look at the results 
for a sinbple product-differentiation model.** 

Let uf return to the simple model developed in chapter 
7 and cpnsider a linear city of length 1. We will assume 
that thqre are only two possible store locations: one 
at each Jof the extremities of the city. This assumption, 
although not crucial, simplifies the presentation. The con- 
sumers, |who are distributed evenly along the segment, 
have a transportation cost + per unit of distance. Time is 
continugus and belongs to the interval [0, +00). At date 
0, the cdnsumer density is unitary. It remains unitary until 
date T, when it doubles instantly; then it remains 2 for- 
ever. (The discontinuous growth of the population, which 
is somewhat contrived, gives us a simple way of model- 
ing locational choice in an expanding city.) 

Therd are two firms. At date 0, firm 1 (the existing 
firm) serves the entire market from its single store at the 
left endJof the city. At any future moment, each of the 
s can build a shop at the right end of the city 
with an} accompanying fixed investment cost f. For the 
time being, we assume that a firm does not exit after 
sinking the investment cost. 

We cbuld assume that each firm can build a store at 
a locatign where its rival already has a store. However, 
because [Bertrand competition with undifferentiated prod- 
ucts yields zero profit, it is easy to see that such a policy 
is not profitable in our model; therefore, we will not 
considey it (but see below). The problem is to determine 
which fitm will invest in building at the second location 
and at What moment this will occur. 

Befort date T, firm 1 earns a profit per unit of time of 


TIG if neither of the two firms has built at the right end of 
the city, II? (without subtracting the fixed building cost) 
if it has built, and [1° if firm 2 (the entrant) was the first 
to build. In the last case, firm 2 also earns I* per unit 
of time. If the unit production cost (net of the fixed build- 
ing cost of the store) is constant, these flow profits are 
doubled after date T because of the population growth. 
We assume that I? > If and M1 > 211*. The first of 
these inequalities simply says that, if we ignore the cost 
of building the store, the existing firm prefers to have two 
stores rather than one; the second inequality says that for 
a given number of stores (here, two), the total profit of 
the industry is smaller for a duopoly because of competi- 
tion. These conditions are quite general. That they hold 
in the case where the consumers have unitary demands, 
where 5 > 2t (F is the consumers’ valuation for the good 
sold by both stores), and where the production cost c is 
zero is due to Mf =5 — ¢, Wf =F — t/2, and M¢ = t/2. 

Let t, > 0 denote the preemption date, that is, the date 
at which one of the firms invests (first) and builds at the 
right end of the city, Let L,(t,) (respectively, F(t, )) be the 
present discounted value of profit, at date 0, for firm i 
when it is first to invest and does so at date ft, (respec- 
tively, is preempted). L and F stand for leader and follower. 
(The leadership is endogenous.) For t, < T, these func- 
tions are given by 


4 r 
it) =f pena + | mre" 
o Nh 


+ i 2mte-"dt — fe", 
2 
F,(t,) = 0, 


T o 
L3(4) = i} Tite-"dt + [ 20Ite""" — fem, 
A 


nl T 


ty T a 
F(t) = [ Mpe""dt + [ Tiedt + [ 211%e""'dt, 
° 4 r 


Ws include Hay 1976, Prescott and Visscher 1977, and Rothschild 
account of these contributions, see Gabszewicz and Thisse 1986. 
Bonanno (1987) analyzes a Prescott-Visscher-type model of sequential entry 
in a spatial] market. He allows firms to open several stores, whereas Prescott 
restrict the analysis to one store or none per firm. At date i 
fp) firm ; decides whether to enter and (if it chooses to enter) how 


monopoly persists: Firm 1 deters entry. In fact. for some values of 
ers, entry deterrence is not achieved through product proliferation: 
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rather, firm 1 opens the same number of stores as a protected monopolist, but 
rearranges the locations of these stores to deter entry. If this strategic location 
choice is not sufficient or not the most profitable way to deter entry, product 
proliferation occurs, 


87. See Rao and Rutenberg 1979 for the optimal timing of plant installation 
when entry cannot be deterred, 


88. The following analysis follows Fudenberg and Tirole 1986, pp. 41~45, 
which builds on Eaton and Lipsey 1979, 
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For t, > T, we can define L, and F; in a similar way. 


where r ie interest rate and f is the investment cost. 
Now stippose that 


2m* m1 
—>fr—. 

r r 
The first |inequality tells us that after the doubling of 
the population, the present discounted value of duopoly 
profit is greater than the investment cost. This condition 
guaranteds that firm 2's entry into the industry is profit- 
able. Thq second inequality says that at any moment 
before T| the duopoly profit (II*) does not cover the 
interest (If) on the investment cost. These two inequal- 
ities implly that in the absence of a preemptive threat by 
firm 1, film 2 wishes to invest exactly at date T (that is, 
L, reachds its maximum at date T), The functions L; and 
F, are derjicted in figure 8.20. 

We défine T, < T such that, at date 7, firm 2 is 
indifferer|t between preempting and being preempted; i-e., 


L3(Tz) =| F,(Tz) = 0. 


We can yerify that L,(t;) > Fa(t,) if and only if 4 > Ts, 
and we fan verify that L,(t;) > F,(t) for any t; > T, 
(using Tf — m1¢ > TT). 

Suppoke that (II — 11s) < rf. In other words, in the 
absence $f a threat of entry, the established firm does not 
choose to invest before T.*% That is, L, is increasing 
before 7} Figure 8.20 fully summarizes the preemption 
game befween the two firms. 

We cdn now solve the preemption game. To do this, 
we cons|der the problem by looking back in time from 
date T.|At that moment, the established firm (firm 1) 
wishes tp invest (if no one has done so before) regardless 
of firm 2's subsequent strategy. Knowing this, firm 2 will 
not allow investment by firm 1; it will preempt at some 
earlier moment T — ¢, because L,(T — é) > F,(T). Firm 
1, knowing the entrant’s preemptive choice at T — ¢, will 
wish to |preempt by investing just before that moment, 
and so fprth. This preemptive spiral stops at moment T>, 
when ‘itn 2 finds further preemption too costly. There- 
fore, in order to preempt firm 2, it is sufficient for firm 1 


Discounted 
profits 
Firm 1 


Firm 2 


0 


Figure 8.20 


to invest just before T,. Since L, is increasing before T;, 
firm 1 waits until that date (or just slightly before it) to 
preempt. Therefore, equilibrium is characterized by the 
following two properties: 


+ The established firm preempts the entrant and retains 
its monopoly. 

+ Preemption occurs before the increase in the popula- 
tion, at the first date when the entrant would have been 
willing to enter in the absence of preemption. 


A correct formalization of the equilibrium strategies can 
be found for a similar game in Fudenberg and Tirole 
1985.°° 

The basic result of the above example is the persistence 
of monopoly. The intuitive reasoning behind the more 
general property goes as follows: Competition is destruc- 
tive of profits; a monopolist with the same production 
technology as a duopoly industry earns more profit than 
the two rival firms together (at worst, the monopolist can 
always make its stores choose the strategies followed by 
the competing firms). This property, called the efficiency 
effect and reflected here by the inequality MN? > 211°, is 
very general and forms the basis for the phenomenon 
of monopoly persistence. At the time of entry, the poten- 
tial entrant bases his decisions on duopoly profit per 
unit of time II‘. Now consider the choices available 


89. This intquality was satisfied above: If — N§ = #/2 = T° < rf 
90, The alpte reasoning is very loose. Familiar strategies for continuous-time 
E 


games—cafled “distributional strategies,” and specifying a (right continuous) 
cumulative| probability distribution that a firm has moved by any date f— 
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are not “rich” enough to describe such preemption games. Richer and more 
satisfactory strategies are obtained by taking the limit of the discrete-time 
model while allowing reasonable behavior. See Simon and Stichcombe 1986 
and Simon 1987 for useful elaborations on this theme. 
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to the existing firm, i.e., either to allow or to preempt 
. Allowing entry implies a loss of Np —M* 
per {init of time. Since It — T1* > IT*, the existing firm 
e of an incentive to preempt than the entrant has to 


dissipation turns out to be socially wasteful, as in Eaton 
and Lipsey 1980. Thus, a social planner would wish to 
elimipate the threat of entry.°* 

Tt is worthwhile to compare preemption games such as 
the one just solved with war-of-attrition games such as 
the dne considered in section 8.1. Both are “games of 
tn In such games, each firm makes a single decision 
(wher to enter in the preemption game; when to exit in 
the war of attrition), In the preemption game, each firm 
meth to be first (at least over a period of time preceding 
the optimal date for moving), but would like to move 
“late” lif it could be sure that its rival would not preempt. 
In the|war of attrition, each firm prefers to “move" second 
(e.g., thot to exit) but would like to move “early” if it could 
be surp that it would be outlasted by its rival. These two 
standard games are only polar examples of games of 
timing, and more general industrial-organization situa- 
tions thay involve different patterns; however, the tech- 
niques|and intuitions derived for these games help us to 
Pant the more complex situations (see Katz and 
Shapirp 1984). 


8.6.2.2] Is Spatial Preemption Credible? 


The anes line of reasoning based on the efficiency 
effect suggests that monopoly situations remain monop- 
oly sitations, which of course is not always the case. We 
will copie what may be wrong with this reasoning. For 
the moment, let us observe that the incumbent's invest- 
ment a preemptive value only if the incumbent is 
wont committed to this investment (see Judd 1985). 
A multiproduct incumbent who can withdraw some of his 


products at low cost may not be able to use crowding 
as a barrier to entry. This seems logical; we have been 
insisting all along that investment deters entry more 
easily when committed. Judd’s interesting insight is that 
if a multiproduct firm competes with a single-product 
rival on some market, the multiproduct firm has more incen- 
tive to quit the market than its rival as long as a low price in 
this market depresses the demand for its other goods, Thus, 
existing products may have little commitment value. 

To see how the multiproduct firm may be forced to 
exit a market, consider the previous model of a linear city. 
Suppose that the incumbent has preempted the entrant 
and has two stores, located at the two extremities. Sup- 
pose further that the entrant follows suit and enters the 
tight-end location itself. If no firm exits, Bertrand com- 
petition drives the price of the two right-end stores down 
to marginal cost c, Hence, each firm makes a zero profit 
at its right-end store. Firm 1 makes a positive profit at 
its left-end store. Because of the transportation cost, the 
goods sold by the left- and right-end stores are differen- 
tiated, and firm 1 can keep its price a bit above c without 
losing all its customers (see chapter 7). However, its profit 
is meager, because the good sold at the right-end location 
is sold at the low price <. Now compare the two firms’ 
incentives to quit the right-end location, assuming that 
the firms do not recoup their building cost f when exiting 
and do not incur any extra exit cost. Firm 2 has little 
incentive to exit, because it makes a zero profit whether it 
exits or stays when firm 1 stays. Firm 1, however, makes 
more money by exiting than by staying, if firm 2 stays. 
By exiting the right-end location, it raises the price at this 
location and therefore increases the residual demand for 
the good sold at the left-end location. For instance, for 
linear transportation costs, uniform density of consumers, 
and a city of length 1, the duopoly price is ¢ + t > c (see 
chapter 7). Because firm 1 was not making any money 
from the consumers purchasing at its right-end branch, 
it cares only about the residual demand faced by its left- 
end branch; thus, it increases its profit by exiting. To sum- 
marize loosely: Exiting the right-end location, a weakly 
dominated strategy for firm 2, raises firm 1's profit. Thus, 


Si ascaccmi 
91, Becauje of the inelastic demand structure, the flow increase in welfare 
(before T)) ‘jated with the introduction of the tight-end store is equal to the 
savings in] average transportation costs: 1/2 — 1/4 = H/4 (assuming that the 
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monopoly always covers the market). The flow cost of the store is equal to rf 
But, by assumption, rf > TI! = 1/2 > 1/4 
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in equilibrium, firm 1 exits immediately and firm 2 stays; the 
result is q duopoly.°* 
Solving for the overall game, notice that firm 2 kicks 


preemptive purposes, Thus, the equilibrium is such that 
firm 1 ndver enters the right-end location; firm kwo enters this 
location kt date T. No preemption occurs. We conclude 
that small exit costs together with product substitution 
may plate the incumbent firm at a disadvantage and pre- 
vent it from credibly preempting the entrant through 
product jproliferation. 


Exercise |8.10" Two differentiated goods, apples and 
oranges] are located at the two extremes of a linear prod- 


(i) Show that the demand functions are 
D, = (p2 — p, + 8)/2t 
and 
D, = (py — p2 + 1/2 


in the relevant range (|p: — p,| < and prices not too 
high). 

(ii) Solve for the Bertrand equilibrium. Compute the 
profits. 

(iii) Suppose that firm 1 is an apple monopoly, but that 
both firms produce oranges. Compute the Bertrand equi- 
librium. Show that IT’ is smaller (by a factor of 4) than in 
question ii. Explain. 

(iv) Suppose that there are no exit costs, that entry 
costs are sunk, that firm 1 is in both markets, and that firm 
2 is in the orange market (as in question iii). Which firm 
has an incentive to exit the orange market? What do you 
conclude about the role of sunk costs or exit costs with 
regard to the possibility of entry deterrence through 
product proliferation (e.g., firm 1 entering first in orange 
markets)? 


8.6.2.3 Do Monopolies Persist? 


Subsection 8.6.2.1 unveiled an important factor favor- 
ing the persistence of monopolies: the efficiency effect. 
Because competition destroys industry profits, an incumbent 
has more incentive to deter entry than an entrant has to 
enter.°* In general, however, this efficiency effect is not 
sufficient for the persistence-of-monopoly result. (This is 
fortunate for the theory: In the United States there are 


strategy for firm 2, because by always charging ¢ it can guarantee 
intertemporal profit, Furthermore, staying is profitable if firm 1 


a more sophisticated argument can be used to derive the same 
hen reentry is allowed.) Suppose now that we rule out weakly 
strategies. as eligible for equilibrium behavior (as is done, for 


If firms kould recoup part of their fixed cost f when exiting, exiting would 
na longer be a weakly dominated strategy for firm 2, but fiem 1 would still gain 
more by dxiting than firm 2, The exit game would then resemble a war of 


probability of exit exceeds firm 1's.) 
93. This ficiency effect rests on the comparison between a monopoly and a 
duopoly. Dne might conjecture that, more generally, a big firm has more 
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incentive to preempt than a small firm. This, however, is not correct. Suppose, 
for instance, that the initial market structure is a duopoly. Firm 1 (the big firm) 
has unit cost 1 and firm 2 (the small firm) has unit cost 3. There is no fixed cost, 
and the firms wage Coumot competition. Suppose that an innovation comes 
along that makes a technology with unit cost 2 accessible at low adoption cost, 
Even in the case where a firm can buy an exclusive right to this technology 
(thus excluding its rival), it is not clear that firm 1 will preempt its rival 
and purchase the new technology (which it would do only for competitive 
purposes, and not for productive purposes). It may be the case that the cost 
reduction for firm 2 offsets the loss in industry revenues stemming from 
more intense competition. Thus, firm 2 may have more incentive to buy the 
technology than firm 1 (see Leung 1984 and Kamien and Tauman 1983 for 
related ideas). This comes from the fact that we are comparing an initial 
duopoly situation with a subsequent duopoly situation, If the initial industry 
configuration were an unconstrained monopoly (firm 2 starting with a large 
unit cost), then the efficiency effect would prevail, 
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very few pire monopolies, In the absence of regulatory 
restrictions, |multifirm markets are the norm.) 

First, prepmption must be effective. Either it allows the 
preemptor fo establish a property right on the tech- 
nology (e.g through a patent or exclusive licensing) or it 
commits th¢ firm to intense price competition if the rival 
follows suit] An example in which preemption is not fully 
effective (in} the sense that it allows entry) is the Stackel- 
berg game |without fixed costs (see section 8.2). In this 
example, the Stackelberg leader does not own property 
rights on chpital; furthermore, capacity constraints pre- 
vent fierce price competition, so that the only way the 
leader can des entry is to accumulate enough capacity to 
serve the whole market at a price equal to the marginal 
cost of inv¢stment plus production (and thus make no 
profit itself} By simply accumulating the Stackelberg fol- 
lower’s capacity on top of the leader's, the leader does not 
deter entry.|Another example in which preemption is not 
effective is When the incumbent's investment has no com- 
mitment value, as in the example of product dropping 
considered ih the preceding subsection. 

Second, te technology of preemption must be deter- 
ministic. That is, firms must have a means of preempting 
their rivals. With a nondeterministic technology (as in the 
patent contésts considered in chapter 10), there may be 
no way for The incumbent to guarantee that it will obtain 
the technoldgy first.°* 

Third, ev}n in a situation in which preemption is both 
effective ani deterministic, it is hard to believe that 
a vororai always keeps his privileged situation. In- 
deed, severa} variations of the preemption model give rise 
to a positive probability that an oligopolistic structure 
emerges: 


(1) The incumbent does not possess the entrant's technology. In 
this cbvicl case, the incumbent cannot duplicate the 
entrant's strdtegy a bit earlier, which may leave scope for 
entry. In a product-differentiation model, for instance, 
the incumbent might not be able to build at the right-end 
location, 


(2) The incumbent may not have time to preempt the entrant. 
This is the case when an innovation appears that both the 
incumbent and entrant would like to adopt immediately. 
Preemption would then require the incumbent to adopt 
the innovation before its appearance, which is impossible. 
This lack of time is also present (in a disguised way) in the 
simultaneous models of entry, in which there is only one 
period to invest. The models of competition in location 
and of monopolistic competition analyzed in chapter 7 
belong to this category. In the case of the linear city with 
two locations, it is easy to see that there exist two equi- 
libria in pure strategies if entry decisions are made simul- 
taneously. In one equilibrium, the existing firm is the 
only one to invest and build at the right end of the city 
(monopoly persistence); in the other, the entrant is the 
only one to invest and build at this location (entry).°* 

The lack of time is also implicit in the investment 
models in which firms are not allowed to build more than 
one plant or introduce more than one product, The im- 
plicit assumption there is that rapid investment (in a sec- 
ond or third plant or product) is very costly, and that 
single-plant or single-product firms can enter before the 
established firms can pursue their expansion. 

In a sense, the simultaneous-entry models of chapter 7 
correspond to very long information lags: There is no 
way a firm can observe its rivals’ choices before making 
its own. This is clearly an extreme assumption, even in 
situations where the firms’ investment decisions take a 
while to be observed.”® More generally, one may consider 
dynamic rivalry under non-negligible information lags 
(imperfect information). Fudenberg et al. (1983) consider a 
game in which, under perfect information (no information 
lags), only the incumbent conducts research and develop- 
ment, whereas with information lags competition may 
arise. If the entrant did not try to enter, the incumbent 
would delay its investment decision (as in the preemption 
game, the incumbent would like to move “slowly”) and 
it would pay the entrant to enter. But the entrant will 
try to enter only if it has a chance of being first. Thus, 


94, Furthermore] we will see in chapter 10 that, because the date of preemption 
is random, the infumbent may not want to hasten its own replacement and may 
therefore have lebs incentive than the entrant to invest in R&D. 


“natural monopdly” refers to the right-end location and not to the whole 
market), there expts a third, mixed-strategy equilibrium. in which both firms are 


95. Asis usual it] the simultaneous-entry models of a natural monopoly (where 
“ 
indifferent betwepn entering and not entering. 
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96, And where firms cannot communicate their investment decisions in a 
credible way. Indeed, it may be in the interest of a firm to release investment 
decisions so as to deter entry. (Firms actually do announce construction of 
plants or, like IBM, preannounce their products) Conversely. the absence of 
announcement signals the absence of investment (unless announcements reveal 
precious technological information), 
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calculate exactly the optimal preemption date 
e case of an information lag, the existing firm 


optimal Hate of preemption, T , is much later for a high- 
cost entrant than for a low-cost entrant. Intuitively, if 
the protability that the entrant has a high entry cost is 
sufficiently high, it does not pay for the incumbent to 
make sufe he preempts the low-cost entrant. The benefits 
of obtaining a higher L, offset the loss associated with the 
probabillty of being preempted. 
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Answers and Hints 
Exercise 8.1 


The situation is depicted in figure 8.21. Let C = { p‘,q°} 
denote the point at which the average-cost curve inter- 
sects the demand curve, let q* denote the most efficient 
scale, and let p* denote the minimum average cost. A 
market-clearing, profit-making allocation must lie on the 
demand curve to the northwest of C. In particular, the 
market price must (weakly) exceed p*. Now suppose that 
an entrant comes in at a lower price p* between p* and p°, 
and produces q* = q". That is, the entrant rations the 
consumers at price p*. Because the price charged by the 
entrant strictly exceeds its average cost (which is p*), the 
entrant makes a strictly positive profit and the initial 
allocation is not sustainable. Thus, there exists no sustain- 
able allocation. 


Exercise 8.2 


(i) There are several meanings to “natural monopoly” 
(the notion depends on the application to be made). One 
meaning refers to the socially efficient production pat- 
tern, Because of increasing returns to scale, one firm is 
the optimal arrangement (if its price can be controlled). 
Another meaning looks at the maximum (upper bound 
on the) number of firms in the industry. Here, even 
if firms can somehow collude, they make at most TI™ 
= max|p(1 — p)] = 4. Since f = 7, if there are two firms 
at least one of them loses money. 


Pp 


Figure 8.21 
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(ii) The kontestable price satisfies p(1 — p) =f, or p= 4 
= p"/2, Welfare is equal to net consumer surplus plus 
profit (i.e.,/here, net consumer surplus), Here, 

w* = (1 + p)?/2 = (7/2 = #. 


(The social optimum is reached for p= c=0, which 
yields w* = } — 3; = 3%.) In continuous time, welfare is 
equal to W* = w‘/r, 

(iii) In the symmetric equilibrium of a war of attrition, 
each firm ¢xits with probability x dt between ¢ and t + dt 
if both firms are still in, Indifference requires 


x imi a3 oeeap 
r 16 


To compyte the expected intertemporal welfare, first 
notice thal instantaneous welfare is equal to 


w,=4—2G%) = 


when ther¢ are two firms (because they charge the com- 
petitive price) and 


wm = 44+ fh — t6) = ie 
when only one firm is left. The arrival of a monopoly 


situation ig a Poisson process with parameter 2x. Hence, 
intertemporal welfare is equal to 


w= [ [p?¥w, + (1 — e2**)w, Jed. 
° 


(The probability that there are still two firms at tis e~2*!.) 
Hence, 


pie (SEL es foe 
2xritr] r 2x+r) rv” 


(Wis a welghted average of w, /r and w,/r.) That is, 


=—|< Ww 


Here we kijow, trivially, that W< W°, because both w, 
and w, ard lower than w*. But these inequalities hold 
more generally. 


Exercise 8|3 


(i) Profit] is 1.5 for one plant and 1 for two plants. 
Accumulating three or more plants is irrational: Profit is 
negative iff all plants are used, and the corresponding 
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output is sold on the market; and there is no point in 
building capacity that is not used. 

(ii) Each firm makes equilibrium profit 0.5. Building a 
second plant drives the market price down to 3, which is 
below the unit cost of constructing a plant. 

(iii) By building two plants, the Stackelberg leader ap- 
propriates the Cournot industry profit 2 x 4= 1. Build- 
ing one plant allows entry of firm 2 with one plant and 
yields the same outcome as in (ii). 

The first-mover advantage results in the monopoliza- 
tion of the industry. However, in contrast with the fixed- 
cost case discussed in the text, the monopoly structure 
yields the same social welfare as the duopoly structure 
corresponding to simultaneous entry. 

See Gilbert and Harris 1984 for a description of com- 
petition with lumpy investments with a more sensible 
time structure and time-dependent demand; see also the 
discussion of the persistence of monopoly in section 8.6, 


Exercise 8.4 


(i) See chapter 5. 
(ii) Because any capacity is used ex post, everything is 
as if the consumers’ demand were 


p=1-K,-K, 


(subtracting the capacity cost 3 from the intercept). The 
consumer surplus is 


(K, + K,)?/2, 
and the domestic firm's profit is 
K,(1— K, — K,) —f 


if it enters, 0 otherwise. Welfare is the sum of the two. 
No protection From our derivations in the text, entry 
is blockaded by the foreign firm. The latter invests the 
monopoly capacity K, = 4, and the domestic firm stays 
out: 
maxlK,(1 — Ky — K,) — f= 0 
2 
Welfare is equal to consumer surplus: W, = { = 0.125. 
Limited protection In this case, K, and K, are chosen 
simultaneously, In equilibrium, K, = K; = 4. The con- 
sumer surplus is §, and the domestic firm's profit is 
4 — 7s. Thus, the welfare is given by W, ~ 0.271 > W,, 
because by preventing entry from being blockaded the 
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manent} protection is detrimental, as it leads to high 
domestic prices. They argue that the model fits well with 
the Japanese experience in the 1960s. (See Matsuyama 
1987 fdr an analysis of the case in which the government 
can corpmit to a protection policy in the short run but not 
in the Ipng run.) 


Exercige 8.5 


Let K, Jand K, denote an entry-accommodating equilib- 
rium, They satisfy 


m1 + . = (Ky + K,)IP(K, + Ky + Ra(K, + K2)) 
=tg—'c} 

oe that 

n> t' + 0, 

and let! 
A= KP —(K, + K,). 


We clajm that firm 1 would be better off investing K, + A 
instead] of K,. By definition of A, this would deter entry. 
Firm 1’ profit would be 


Fit = dK, + A)[P(K*) — cy — ¢] 
= II — K,(P(K*) — cg — ¢) 
> (1! + K,[P(K, + Kz + R3(K, + Kz)) — P(K®)}. 
Becausp P is decreasing, it suffices to prove that 
K, + By + Ra(K, + Ky) <K° 
to obtdin a contradiction. Suppose that 


K+ i + Ry(K, + Ky) > K°. 
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Then firm 1 could have invested K, + R3(K, + K3) in- 
stead of K, and made profit 


W+dP+/)>M'+f>, 


where f is firm 3’s entry cost (that firm 1 need not pay to 
accumulate more capacity), where use is made of the 
fact that 


IK, + Ra(K, + Kz)] + Ki 
deters entry. Again, we obtain a contradiction. 
Exercise 8.6 
(i) The monopolist maximizes 
(1 —q* —oq* + (1 —q® — (c — 2g) q® 
with respect to q* and q®. The first-order conditions are 
(1 —c — 29) + 4g? =0 
and 
1—c + Aq* — 24" =0. 


(ii) If g@ is not observed, it is rationally anticipated to 
be @f, say. Thus, Cournot competition occurs relative to 
cost &8 = c — Aq". Firm 1 cannot affect firm 2's behavior 
by changing q}. The first-order condition is thus 


(1 —c — 29) + Aq? =0, 
where 
g® =[1 + ¢— 2c — AqPA)V3 = (1 — c + 2498)/3. 


Simple computations, together with 7° = qj in equilib- 
rium, yield 
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If q is observed by firm 2, then firm I can influence firm 
2's output. Thus, the first-order condition is 
dc® 


n= 0 
agi 


ant 
(1—c—2qf) + age et 
or 
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Firm 1 gains when firm 2 observes its first-period out- 
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put. By slightly increasing its output beyond (3 + A)d/ 
(6 — 242), It incurs only a second-order loss if its rival 
does not change its output, But firm 2 knows that c? is 
lower, and therefore that 4? will be higher, Firm 2 curtails 
its output, which benefits firm 1 to the first order. With a 
fixed cost df entry, firm 1 may want to push its output 
beyond (9 }+ 44)d/(18 — 81?) (assuming that qA is ob- 
served) to deter entry (as in the Stackelberg game dis- 
cussed in thp text). 

Of cours¢, learning by doing is just one way of reduc- 
ing future qosts. Capacity accumulation (as in the text) 
and investment that reduces marginal cost (as in Brander 
and Spencer 1983) serve somewhat the same purpose but 
are independent of the firm’s current pricing behavior. 


Exercise 8, 


A decrease |in firm 1's marginal cost shifts that firm's 
reaction curye to the left, as figure 8.22 shows. The equi- 
librium x} 4 pf is thus reduced, which hurts firm 2. To 
make entry #nattractive, firm 1 should reduce its marginal 
cost (i.e., shquld overinvest). 


Exercise 8.3 


False. A qudta transforms the foreign firm into a puppy 
dog. If the qhiota is “not too small” (just below the Nash- 
equilibrium qutput), the quota commits the foreign firm to 
charge a pride a bit above the Nash price and induces the 
domestic firrp to raise its price. The quota imposes a direct 
second-ordef loss if the quota is close to the Nash output, 
and the strafegic (indirect) gain for the foreign firm is a 
first-order ore. 


Exercise 8.9 


Consider the|(Bertrand) Nash equilibrium (p}, p3) in prices 
in the absente of government intervention. For e small, 
TI'(p} + e, pp) is equal to I1* (pf, p3) to the second order 
in @ (becaus¢ firm 1 optimizes), Now consider a price 
floor equal td p, = pf + e. See figure 8.23. Then 


(pi, p32") = (pt + &, Raipt + &)) 


is the new Nash equilibrium. Given p%’, firm 1 would like 
to reduce its price; however, it cannot. Firm 1 is better off 
in spite of (qctually, because of) the fact that its set of 
choices has shrunk. The price floor commits it to charge a 
high price. This is the “puppy dog” effect. 
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Exercise 8.10 


(i) and (ii) See chapter 7. 
(iii) The price of an orange is c. Hence, the residual 
demand for apples is 


D, = (c+ #—p,)/2k 
Maximizing profit 

(py — O(c +t — p,)/2t 

yields p, = c + #/2 and profits II’ = 1/8. 

(iv) Firm 1 gains t/2 — t/8 = 31/8 by exiting the orange 
market. Firm 2 does not increase its profit by exiting. 
Thus, in the absence of exit barriers, firm 1 exits the 
orange market. Firm 2, therefore, enters the orange mar- 


ket, even if firm 1 is already in both markets (as long as 
the entry cost does not exceed #/2). 
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Information and Strategic 
Behavior: Reputation, Limit 
Pricing, and Predation 


Oligopolists are affected by many variables they cannot 
observe or estimate precisely: their own cost function, the 
cost functions of their rivals, the state of demand or the 
potential of the market, and their rivals’ strategic deci- 
sions. To the extent that some pieces of information are 
private (for instance, a firm may have a good estimate of 
its own cost but no good estimate of its rivals’ costs), we 
must envision market interaction as a game with asym- 
metric information. 

The simplest case is when oligopolists compete in a 
static way (ie, they meet only once). The simplicity 
of this case stems from the fact that, although a firm's 
information is at least partly revealed by its actions, no 
use is made of the revealed information, as there is no 
future. Firms thus behave myopically, and simply max- 
imize their static profit given their private information 
and their assessment of their rivals’ actions on the basis 
of their own private information. Section 9.1 addresses 
the case of static price competition in which firms have 
private information about their own cost (a special case of 
which is a one-shot auction), Most of the intuition for the 
dynamic case can be derived from this static analysis, as 
we will see in section 9.2. 

We saw in chapter 8 how a firm can use tangible 
variables (capital, experience, etc.) to protect and extend 
its market share. However, a firm's history also matters 
because it conveys information to the firm’s rivals and 
thus affects intangible variables such as beliefs, In multi- 
period oligopolistic interactions, a firm’s behavior reveals 
some of its private knowledge, which the firm's rivals 
can later exploit. A rational firm, realizing this, seeks to 
manipulate its rivals’ information in order to derive bene- 
fits later on. Such manipulations can be called “invest- 
ments in disinformation.” The investment analogy is not 
accidental. We will see that chapter 8's taxonomy of 
rival-softening strategies applies to situations of asym- 
metric information. Section 9.3 considers accommodation 
(repeated interaction in a stable industry structure) and 
argues that, under some conditions, asymmetric informa- 


tion givey producers of substitute goods incentives to 
raise their|prices. 

Sectionf 9.4 through 9.6 look at the more aggressive 
strategies Associated with entry deterrence or exit induce- 
ment (predation). Section 9.4 deals with the Milgrom- 
Roberts qeconsideration of limit pricing. Section 9.5 
analyzes predation in cases of merger, section 9.6 the 
reputation] of a multimarket firm. Sections 9.7 and 9.9 
‘o other predatory behaviors: the “long purse” 
story (in Which the predator tries to exhaust the prey's 
resources)|and the war of attrition," 


ic Competition under Asymmetric 
Information 


9.1.1 Simple Model of Price Competition 


Intuition foncerning games of asymmetric information 
can be gajned from a simple static example. Consider a 
two-period rivalry. The second period is the last, so the 
firms then] behave as in the static game with the informa- 
tion structure defined by their posterior beliefs at the 
beginning] of the second period. That is, the study of the 
also yields the firms’ second-period payoffs as 
functions pf the posterior beliefs. But these beliefs are the 
prior (dat? 1) beliefs updated to reflect the information 
conveyed|by each firm's first-period behavior. The static 
analysis therefore demonstrates what each firm wants its 
rivals to believe about its private information and sug- 
gests how first-period behavior may be used for this 
purpose. 

Here, we will consider a differentiated duopoly price 
game in Which, for simplicity, one of the two firms has 
incomplet? information about its rival's cost. Suppose that 
the demad curves are symmetric and linear over the 


relevant range: 
Dip, p;) = a — bp, + dp; 


where 0 £ d <b. (If each firm raises its price by $1, 
both should lose sales. This requires d < b.) Assume that 
the two gbods are substitutes and strategic complements 


(d > 0),? and that the firms have constant-returns-to-scale 
technologies. Firms 2's marginal cost, ¢, is common 
knowledge, but firms 1’s marginal cost, ¢,, is known only 
to firm 1. Suppose that for firm 2 ¢, can a priori take two 
values: ct (with probability x) and c# (with probability 
1 — 2), where c} < cH. Let 


ef = ack + (1—x)cf 

denote firm 1's expected marginal cost from the point of 
view of firm 2. The ex post profits can be written as 
TH(p,, By) = (pi — ea — bp; + dp,). 


The two firms choose their prices simultaneously, so 
we look for the Bertrand equilibrium.’ Firm 2 charges 
P2 = p3. Firms 1's price naturally depends on its marginal 
cost. Let pt and pf! denote the prices chosen by firm 1 if 
it has cost ct or cf'. 

Firm 1's profit maximization yields, for a given ¢, and 
price p3 charged by its rival, 


a — 2bp, + dp} + be, = 0, 
or 


_ a+ dps + by 


9.1 
Pp 2b (9.1) 


p, is an increasing function of firm 1's cost, whatever price 
firm 2 is expected to charge. From the viewpoint of firm 
2, which does not know ¢,, firm 1's expected price is 


ps = apt + (1 —x)ptt 


_ (at dps + bet - (.r ieee) 
- (tte +(t—2) 7 ca 


= tt (9.2) 
Firm 2 is risk-neutral, so it maximizes its expected profit, 
Er? = x(p2 — c2)(a — bp, + dpt) 

t= 3) (py — Gala py eat) 


= (p2 — ¢2)(a — bp. + dpi), 
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3, Technically, we compute a Bayesian equilibrium (see the Game Theory 
User's Manual). 
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with respect fo p,. This yields* 


_ at dpi] + be, 


P2 z (9.3) 


In Nash equillbrium, p, = p3. Thus, equations 9.2 and 9.3 
yield prices pf and p3. Equation 9.1 then yields p} and p{'. 
For the following we will need only 


2ab + $b?c, + ad + bdef (9.4) 


= ab — a 


Figure 9.1] which depicts the equilibrium, reflects the 
fact that firm 1's reaction function is contingent on its 
cost—it es to the right when the cost increases. Under 
symmetric information the Bertrand equilibrium would be 
at B or C, depending on whether c, is low or high. Under 
asymmetric information, everything is “as if” firm 1 had 
an “average rpaction curve” Rf. Prices p3 and pf are given 
by the intersdction A between firm 2's reaction curve and 
firm 1's average reaction curve. 

The crucial property for our discussion of dynamics is 
that (from equation 9.4) firm 2's price is an increasing 
function of frm 1's expected cost and therefore an in- 
creasing function of 1 — x. Firm 2 raises its price when 
it becomes more likely that firm 1’s cost is high, because 


Pp 


Figure 9.1 


a high c, induces a high p, and the goods are strate- 
gic complements (ie, the reaction curves are upward 
sloping). 


9.1.1.1 Information Exchange 


Now suppose that firm 1 can report verifiable informa- 
tion about c, costlessly. That is, firm 1 can, if it wants, 
perfectly reveal its cost to firm 2. It is easy to see that firm 
1 will indeed do so, despite the fact that, when it has a low 
cost, revelation makes it look aggressive and triggers a 
low price p,. When firm 1 has cost cj, revelation induces 
(full-information) posterior beliefs x’ = 0. From equation 
9.4 and the fact that cf has become equal to cf!, firm 2's 
price is as high as it can be in equilibrium.* Thus, a 
high-cost firm has every incentive to reveal its cost to 
soften firm 2’s pricing behavior. A low-cost firm has no 
such incentive; however, not releasing the cost informa- 
tion signals that costs are low; if they were high, the 
information would have been released.° 

This simple reasoning carries an important moral for 
future dynamic analysis. Firm 1 has an incentive to prove 
that it has a high cost before the firms engage in price 
competition. In the rest of this chapter, we will assume 
that no such direct exchange of information is feasible 
(cost data are not verifiable, say); rather, firm 1 will try to 
signal that its cost is high’ through its market behavior. 
As we will see, such signaling will not be costless. 


Exercise 9.1°* Consider the previous model, but with 
symmetric cost uncertainty. The demand functions are 
D,(p;.P;) =1—pi +p, The firms’ marginal costs are 
drawn independently from a common distribution with 
mean c® and variance 7. A firms cost is private informa- 
tion, Each firm can make its cost information available to 
its rival (information is verifiable). However, for some 
reason, firms cannot reveal their information ex post unless 
they have signed an agreement ex ante to do so. Consider 
the following sequence of events: (1) The firms, before 
knowing the realization of their costs, decide whether 


4, The fact that p depends only on firm 1's expected price is, of course, special 
to the linear strucibce ofthis model 


5, That is, (2ab +]2b%c, + ad + beleft)/(ab  — a?) 
6. The argument is formally similar to the argument that a seller with unknown 


quality should givp a full warranty if he can do so costlessly. See example 3 in 
section 5 in the G; Theory User's Manual. 
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‘The argument is not limited to two potential cost levels. For instance, with 
three types, the high-cost type always releases cost information, Hence, the 
absence of release signals that the cost is low or intermediate, which induces the 
intermediate type to also release cost information. 


7. In other instances, firm 1 will try to signal that its cost is low, 


Information and Strategic Behavior 


to share gost information ex post. (2) Firm i learns ¢;, 
(i = 1,2). [3) Firms share information if they decided to 
do so earlier; otherwise they do not. (4) They compete in 
prices, 

(i) Solvp for the price equilibrium in the absence of 
information sharing. (Hint; Use the symmetry, Show that 
each firm’g average price is p* = 1 + c*,) Show that firm 
i's ex ante profit is 
EEM= 


we 


+ 07/4. 


(ii) Solve for the price equilibrium under information 
sharing, Show that firm i's ex ante profit is now 


EEM=[1 + 207/9. 


ce 
(iii) Cope that the firms do not agree to share cost 
informatidn. Discuss this result.* 


9.112 Auctions 


An interebting special case of price competition under 
asymmetric information is the case of a first-price auction. 
In its simplest form, an auction has the following fea- 
tures: The] buyer (who stands for consumers in the price- 
competitign model) has a unit demand. Each seller (firm) 
knows its |private cost of supplying one unit of the good, 
but not it9 rivals’. A first-price auction chooses the lowest 
bidder, who receives his bid. This first-bid auction is thus 
equivalent to price competition with perfect substitutes 
(and a particular demand curve).” 


relies on bur assumptions on the demand functions, the 
nature of jasymmetric information, and the type of com- 
petition. In doing so, we will focus on pre-competition 
incentiveg to reveal information. 


+ Complements versus substitutes We assumed that the 
goods produced by the two firms were demand substi- 
tutes (d > 0). Were they complements (d < 0), each firm 
would also want to pretend that its cost was high. To 
understand why, suppose that firm 2 believes that firm 1’s 
cost is high. Then firm 2 believes that firm 1 will charge 
a high price, reducing the demand facing firm 2. Firm 2 is 
then led to reduce its price, which raises firm 1's demand. 
With demand complements, charging a low price is like 
supplying a public good; each firm claims that it will not 
supply the good in order to force its rival to do so,!°'! 

+ Information about cost versus information about demand 
We assumed that the firms had private information about 
their costs. They can also have private information about 
demand. In the above example, they may not know the 
intercept a, for instance, but they may receive private 
information about it: a, = @ + &, where #; is a zero-mean 
error around the unknown true value a. In this case it is 
easy to see that each firm will want to pretend that the 
demand is high (i.e., to announce that a, is high). Doing so 
gives one’s rivals an incentive to raise their prices.'* 

+ Price competition versus quantity competition As in chap- 
ter 8, the results are usually reversed if price competition 
is replaced with quantity competition (ie, “capacity” 
competition).'* For instance, a firm wants to demon- 
strate to its rivals that it has a low cost. This signals high 
production, which forces the rivals to curtail their own 
production. 


9.2 Dynamics: A Heuristic Approach 


In the remainder of this chapter, direct disclosure of 
private information is ruled out. Firms try to manipulate 
their rivals’ knowledge indirectly, by adopting a market 
behavior that differs from the optimal behavior under 
symmetric information. To focus this discussion, let us 


8 Informatioh sharing in oligopoly has been studied by Clarke (1983), 
Fried (1984), Gal<Or (1985, 1986), Li (1985), Nalebuff and Zeckhauser (1985), 
Novshek and enschein (1982), Ponssard (1979), Shapiro (1986), and Vives 
(1984), 


9, For extenfive surveys of the literature on auctions, see McAffee and 
MeMillan 1957 and Milgrom 1987a. For a simple introduction to the main 
se of two types per bidder, see Maskin and Riley 1985. For an 
lution of equilibrium in a first-bid auction. see example 3 in 
section 4 of the Game Theory User's Manuzal. 
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10. In the terminology of chapter 8, goods are strategic substitutes, and firm 
2's low price is equivalent to softness, 


11. In the rest of this chapter, it will be assumed that the goods are substitutes 
and strategic complements. 


12. Inthe case of uncertainty about the slope b, the firms would like to pretend 
that b is low (Le, that demand is high), Similarly, they would like to pretend 
that d is high (ie. that demand is high). See equation 9.4. 


13, As long as prices are strategic complements and quantities are strategic 
substitutes. 
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suppose that} firms compete in prices over two periods, 
and that private information pertains to own cost. If entry 
or exit is nolj an issue, a firm wants to appear unaggres- 
sive so as to induce its rivals to raise their prices. Thus, in 
the first perio}, it charges a high price to signal that it has 
high costs.'*|Thus, accommodation calls for the “puppy 
dog” strategy, in the language of chapter 8. The outcome 
is a more colljsive one than under symmetric information. 
Section 9.3 informally reviews arguments along these 
lines. 

To deter tHe entry or induce the exit of its rival, a firm 
adopts the mpre aggressive “top dog” strategy. Indeed, 
this firm want to signal a low cost so as to induce its rival 
to doubt its viability in the market, so the firm charges a 
low price rather than a high one, This is the basis for the 
limit-pricing thodel of section 9.4. Two important varia- 
tions on that fnodel are given in sections 9.5 and 9.6. In 
the limit-pricihg story, a low price is meant to dissuade 
the entry or induce the exit of a rival. When mergers are 
feasible, it may also help a firm to buy out a rival cheaply, 
because a rival who has been convinced that the market 
is not very profitable is prepared to sell out at a low 
merger price |see section 9.5). Also, predation may not 
be primarily geared toward inducing the exit of a cur- 
rent rival, It shay, rather, signal to prospective entrants 
that entry is hot profitable. For instance, a firm that has 
monopoly Power on several geographical markets may 
prey ona that enters one of its markets not because 
such predatior} raises its profit on this market, but because 
it deters entry|by the same or other competitors in other 
markets (see s¢ction 9.6) 


about its cost (or about the demand), the same principles 
apply as in the signaling framework, except that firm 1's 
signal is garbled by the noise in the demand function. 
However, in the case of price secrecy firm 1 need not 
have private information to try to manipulate firm 2's 
information, Suppose, for instance, that demand is un- 
certain and correlated over time. Firm 2's demand then 
depends on two unobserved variables: demand uncertainty 
and firm 1's price. Firm 2 faces a “signal-extraction prob- 
lem," and is unable to fully recover the demand parame- 
ter. As we will see, a low price for firm 1 may be mistaken 
by firm 2 for a low state of demand, because both reduce 
firm 2's demand. Firm 1 then has an incentive to jam firm 
2's inference process. Fortunately, the techniques and the 
intuitions for the signaling model and the signal-jamming 
model tend to be very similar.'* 


9.3. Accommodation and Tacit Collusion'® 


The idea that a firm may raise its price to signal high cost 
and soften the behavior of its rivals was developed by 
Ortega-Reichert. In chapter 8 of his 1967 thesis, he con- 
sidered a duopoly model of two consecutive first-price 
auctions. (Think of auctions organized by government 
agencies for oil leases, medicines, or electric power equip- 
ment.) Two firms (i = 1, 2) draw their costs c, of supply- 
ing one unit of the good in a common cumulative distri- 
bution Fic,|4), where 4 is an unknown parameter of the 
distribution (itself drawn from a distribution G(A)). The 
costs are drawn independently between firms and be- 
tween periods, so the results do not depend on serial 
correlation. Demand is common knowledge. The lowest- 
price bidder gets to supply one unit at the price he bids, 
and the other bidder does not sell; and this is true in each 
of two periods, A and B. If the first-period costs drawn, cj* 
and c3, were common knowledge, the two firms could 


14. The reason wh} such a strategy signals cost is that raising one's price is 
marginally less costly to a high-cost firm than to a low-cost firm. Mathemati- 
cally. 


afe 

Help — fora!) > 0 

where Dj(+) is the residual demand curve and p_, denotes the price(s) of the 
rivals), Readers whd are familiar with the Spencian signaling model (see Game 
Theory User's supplementary section) will note that this is nothing but 
the usual sorting coddition for signaling games. 
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15. Readers who are knowledgeable in dynamic games of incomplete informa 
tion (see Game Theory User's Manual) may, however, note that signal-jamming, 
models tend to have fewer equilibria. (There usually are fewer out-of-equilibrium 
beliefs and. therefore, less leeway in specitying firm 2's perception of the 
unknown parameter.) See Milgrom 1987b for some relationships between 
hidden-action and hidden-information models. 


16, This section, though informal, will be difficult for readers who are not 
familiar with games of incomplete information. Section $ of the Game Theory 
User's Manual should be helpful for those readers. 
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use them t@ update their beliefs about 4. However, it is 
assumed that firm j observes only c} and its rival's first- 
period pric¢, p/. Thus, in the first period (A), firm i, by 
raising its price above what would be optimal for the 


i convinces firm j that firm i’s second-period 
likely to be high. This information, by the 


to raise its own price by $y, where » is the coefficient 
of conjectural variations. As we saw in section 6.2, such 
beliefs are not rational in a static model: Because the rival 
has no time to react, the only rational conjecture is » = 0. 
In a dynamic model, however, a firm may induce its rival 
to raise its price (in the future, not in the current period). 

To see how this works, suppose that each firm's mar- 
ginal cost is 0. Firm i’s demand is 


= a— Pt Py 


The demand intercept a, which is unknown to both firms, 
is distributed along the real line (for simplicity), with mean 
a*. Consider first a one-period model. Firm i maximizes 


Ela — p, + pp, = (a —p, + py) 

over p;, which yields 

p= (at + py)/2. 

By symmetry, the static Bertrand equilibrium is 


< 


A =P2= 


Now consider a two-period version of this model. The 
intercept a is the same in each period. Each firm observes 
only the realization of its demand, so its rival can secretly 


3) = Oa Tin 


iH. Then, because there constantly is new information, firms keep 
con charging high prices forever. 
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19. See chapter 6 on tacit collusion. Bikhchandani (1986) supplies an example in 
which repeated bidding leads the firm to build a reputation for tough rather 
than soft bidding, He considers repeated second-price auctions with "common 
values” Suppose, for instance, that two firms bid for oil leases. Each firm has 
private information about the amount of oil in the tract. The value is common 
in that one firm’s information is of interest to its rival not only for the purpose 
of predicting the first firm's behavior but also for the purpose of assessiny, the 
value of the oil tracts. A well-known phenomenon for common values (see, €g., 
Milgrom and Weber 1982) is the “winner's curse”: The firm learns that 
its rival, which submitted a lower bid, did not have a high valuation of the oil 
tract, which is bad news, This leads firms to bid cautiously. The winner's curse 
per se does not imply that a firm manipulates information in the first period. In 
particular, if the valuations are distributed independently and identically and 
there is no serial correlation, the bidders treat the two auctions as independent. 
But assume now that firm 1 may or may not haye an advantage over firm 2 in 
drilling cost. and that this is private information to firm 1. In a single auction. 
firm 2 bids less aggressively because of the possibility that firm 1 cost 
advantage, The winner's curse is aggravated by the fact that if firm 2 wins and 
firm 1 has a cost advantage, firm 1 must have had very pessimistic beliefs about 
the amount of oil. This implies that in a repeated auction framework, firm 1 
wants to develop a reputation for having a cost advantage (Le. for being a 
tough bidder), See Bikhchandari 1986 for more details. See Wilson 1985 for a 
brief account of this and other reputation models. 


20. See also Bulow et al. 1985, Fudenberg and Tirole 1986a, and Mailath 
1985a.b, Riordan’s model is actually phrased in terms of quantity competition; 
however. as he notes, it also applies to price competition as long as one is 
careful about the usual reversal of results between strategic substitutes and 
complements 
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cut price. (Technically, the model is one of signal jam- 
ming.) In gur symmetric equilibrium, each firm charges 
price « in the first period. If the intercept is a, each firm i 
learns a perfectly by observing its first-period demand: 


(ps) = a+ (ps — a). (9.5) 


formation, with intercept a(p/), so it charges 


pi = a(p). Using equation 9.5, we obtain the coefficient 


period profit with respect to pj is 
da 
pP ape pr. 


The expectation of this derivative with respect to a is thus 


A 
A et 
—— 


a 


In the first period, the expected present discounted profit 
of firm i is maximized. Thus, the sum of the first-period 
marginal profit 


d 
appl" — ph + api) 
and the second-period marginal profit is 0: 
A 
a*—2ph+at a(« oF a= *) =0. (9.6) 


In equilibrium, p = a, which together with equation 9.6 
yields 


a=a*(1 +0). (9.7) 


Each firm charges a°(1 + 4) in the first period, rather than 
a® (which would, on average, be charged under complete 
information about a, and which would be charged under 
incomplete information in a single-period model).?? 
Mailath’s setup is closer than Riordan's to the Milgrom- 
Roberts model discussed in the next section, with observ- 
able prices and private information about cost. (The 
Mailath model is a signaling model.) The conclusions, 
however, are similar. This approach by no means justifies 
the static conjectural-variations approach, but it illustrates 
the positive effect of a firm’s price on its rival's price.?* 


_—————— 
9.4 The Milgrom-Roberts Model of Limit Pricing 


It has often been alleged that an established firm can 
discourage entry by charging a low price. Bain’s (1949) 
concept of limit pricing was built around the idea that 
if there is a positive relationship between the pre-entry 


21. The reactjon curves are p? = (a + p?)/2. By symmetry, p? = p? = a. 


22. Here the] second-order condition is satisfied. When @ has 2 compact 
support, the global second-order condition may be harder to satisfy. 
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23. Under quantity competition, the conclusions are naturally reversed, For 
instance, in the Mailath model an increase in quantity signals a low cost and 
therefore a high quantity tomorrow. This induces the rival to curtail output 
tomorrow. As Riordan notes, one then gets negative conjectural variations. 
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price and fhe speed or degree of entry, the established 
firm indeed has an incentive to cut its price.** 

Although Bain’s view prevailed for thirty years, many 
economists felt uncomfortable about applying it to anti- 
trust analysis. To condemn a firm for making consumers 


sibility is that the price of the established firm 
fitment value. That is, the entrants expect the 
pre-entry jprice to prevail after entry. However, such a 
theory is ot very convincing, Entry into many markets is 
a decision|that covers a period of many months or years, 
whereas aj price can often be changed within a few days 
or weeks, (Consequently, any loss that a potential entrant 
may suffet from a low pre-entry price is negligible. Prices 
per se have only a short commitment value (Friedman 
1979), Naw, it may be the case that a low pre-entry price 
is associated with a high pre-entry capacity. The capacity 
may have|a higher commitment value than the attached 
price, ™ source of the entry deterrence is then the 
incumben|'s capacity rather than its price (see chapter 8). 

In this| section we will look at industries in which 
capacities|do not necessarily have commitment value. As 
we will ste, the incumbent firm may nevertheless lower 
its price When facing the threat of entry. The paper that 
is crucial }to the reexamination of the doctrine of limit 
pricing (ahd, more generally, in drawing our attention to 
the implifations of asymmetric information for antitrust 
analysis) b Milgrom and Roberts 1982a. A simplified ver~ 
sion of tHe Milgrom-Roberts two-period model is given 
in the follwing subsection.?* 


9.4.1 A Model 


There ard two periods and two firms, Firm 1, the incum- 
bent, is a]monopoly at date 1 and chooses a first-period 
price p,. Firm 2, the entrant, then decides to enter or stay 
out in th¢ second period. If it enters, there is duopolistic 
competition in period 2. Otherwise, firm 1 remains a 
monopoly. 


As in the static model of section 9.1, firm 1's cost can 
be low (with probability x) or high (with probability 
1 —2). Let Mi(p,) denote the incumbent's monopoly 
profit when it charges p,, where t= L or H (indicating 
low or high cost).?® Let pl, and pH, respectively, denote 
the monopoly prices charged by the incumbent when his 
cost is low and when it is high. We know from chapter 1 
that pi <p. Let Mi and Mf! denote the incumbent's 
profit when it maximizes its short-run profit, depending 
on its type (Mi = Mj(pi,)). Assume that Mj(p,) is strict- 
ly concave in p,. 

Firm 1 knows its cost from the start. Firm 2 does not 
know firm 1's cost. For simplicity (and following Milgrom 
and Roberts), assume that firm 2 learns firm 1's cost im- 
mediately after entering if it decides to enter; the second- 
period duopoly price competition, if any, is then indepen- 
dent of the price charged in period 1. Let Dj and D3 
denote the two firms’ duopoly profits when firm 1 is 
of type t. (D} possibly includes entry costs.) To make 
things interesting, assume that firm 2's entry decision is 
influenced by its beliefs about firm 1's cost: 


D¥ > 0 > Db. (9.8) 


That is, under symmetric information, firm 2 would enter 
if and only if firm 1’s cost were high. The common dis- 
count factor is 6. 

Because firm 1 prefers to be a monopoly (Mj > Di, 
for ! = L, H), that firm clearly wants to convey the infor- 
mation that it has low cost. The problem is that it has no 
direct means of doing so, even if it indeed has a low cost. 
The indirect way is to signal by charging a low price, pr. 
In our example, firm 1 may want to charge pt even if it 
has a high cost. The loss in first-period monopoly profit 
may be offset by the second-period gain from remaining 
a monopoly. Does this mean that the potential entrant 
will stay out when observing p}-? It is not clear. A rational 
entrant, knowing that it is in the existing firm's self- 
interest to “lie” in this manner, will not necessarily infer 
that the incumbent has a low cost. In turn, the incumbent 
knows that the entrant knows about this incentive, and so 
on. As Milgrom and Roberts show, the correct way to 
analyze this dynamic game of incomplete information 


fel positing an exogenous relationship between pre-entry price 
Gaskins 1971 


24. For am 
and entry, se 


25, The analfysis here follows section 6 of Fudenberg and Tirole 1986a. 
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26. Thus Mi(p;) = (p, — ch) DP(p,). where D?(:) is the monopoly demand. 
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is to splve for a “perfect Bayesian equilibrium” (see the 
Game |Theory User's Manual). 

Thefe are two kinds of potential equilibria in such a 
model|?” In a separating equilibrium, the incumbent does 


probaljility x on the cost’s being low).?* 


conditions: that the low-cost type does not 
pick the high-cost type’s equilibrium price, and 


rium 


id get at worst Mi + dD} (at worst, pl, induces 
entry) pnd because in equilibrium he gets Mt(p}) + Mp, 
we “1 have 

ME (pt) < (MP — Db). (9.10) 


To |make things interesting, assume that there is no 
separating equilibrium in which each type behaves as in a 
full-infprmation context; i.e, the high-cost type would 


wish to pool if p} were equal to pi: 
Mf? — M#(pu) < 6(My — DI). (9.11) 


To characterize the set of pt satisfying equations 9.9 
and 9.10, we must make more specific assumptions on the 
demand and cost structures. Subsection 9.4.1.1 (which 
ought to be skipped in a first reading) does so. For our 
purpose it is sufficient to note that, under reasonable 
conditions, equations 9.9 and 9.10 define an interval 
[p,.~,), where p, < pl. Thus, to separate, the low-cost 
type must charge a price sufficiently below his monopoly 
price so as to make pooling very costly to the high-cost 


type. 


9.4.1.1 Derivation of the Interval of Separating Prices 


The reason why it is more costly for the high-cost type 
to charge a low price is the so-called sorting or single- 
crossing condition (see the Game Theory User's Manual): 
HH ,) — ME 
O(Mi"(p;) — Mr(p,)] <0. 
op, 
As we saw in section 9.2, this condition is satisfied here, 
because 
Pig. — 1 m 
P(r —a)DMm)) _ _ dD? so 
Op, 0c, dp, 


This condition ensures that the curves 
y= My — Mt (pt) 

and 

y= Mi — My(pt) 


cross once at most in the {pt,y} space. Next under 
some assumptions, the lower the incumbent's cost, the 
more he benefits from remaining a monopolist. Letting 
D,(p,,P2) denote the duopoly demand for firm 1 (notice 
that D,(p,, +90) = D?(p,)), letting M,(c,) and D,(c,) 
denote the monopoly and duopoly profits for cost ¢,, 
letting p? and p$ denote the equilibrium duopoly prices 


27, sain, there may exist a third kind, in which the incumbent uses a mixed 
strategy, 


28. Bagwell (1985) analyzes whether the incumbent can signal its cost through 
iterate means (eg, whether advertising can “crowd out” pricing as a 
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signaling instrument). He finds that wasteful advertising is not used in equilib- 
rium: Imposing a cost of $1 for the high type to mimic the low type costs the 
low type less than $1 when it lowers its price, but exactly $1 when it advertises. 
For further elaborations (in particular, where advertising is not wasteful), see 
Bagwell and Ramey 1987. 
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(functions of c,), and using the envelope theorem, we 
have 


a(M, (c,) + Dy (c1)1 
ra 


d 
= ae (maxllr — 6) DP (py) 
cy py 
—max|(py) — «)DiuP!) 
"1 
aD, aps 
= Dp yh) + D(php) — (pt - “ape aa 


The third| term in this equality is presumably negative 
(because p? —c, > 0, @D,/ap, > 0, and—under mild 
conditiong??—ép$/dc, > 0). Thus, if the monopoly de- 
mand for firm I exceeds its duopoly demand, M, — D, 
decreases |with c, and therefore 


Mt — Di > M} — Df". 


This is thd case for the unit-demand location model solved 
in chapteq 7, where the monopoly demand was equal to 
the market size (i.e., was the maximum possible demand). 

Using hese derivations, we can obtain the interval 
[f,-P,) frpm figure 9.2. p, is such that equation 9.9 is 
satisfied with equality; it is called the least-cost separating 


MY —My(pt) Mi! Mini) 
8(My ~ DF) 
5(Mi = fy) 
4 = dail 
B 1 Pn rm 
Figure 9.2 


price because, of all potential separating prices, the low- 
cost type would prefer the one at ji, (the closest to Pma)- 


9.4.1.2 Analysis of a Separating Equilibrium 


The necessary conditions studied in subsection 9.4.1.1 are 
also sufficient, Let the high-cost type choose pp and let 
the low-cost type choose pt in [p,,p,]. When a price 
that differs from these two prices is observed, beliefs are 
arbitrary (in an unexpected event, Bayes’ rule does not 
pin down firm 2's posterior beliefs). The easiest way to 
obtain equilibrium is to choose beliefs that induce entry 
(this way, the two types will be little tempted to deviate 
from their presumed equilibrium strategies), so let us 
specify that when p, does not belong to {pm pr} the 
posterior beliefs x’ are equal to 0 (firm 2 believes that firm 
T has a high cost).3” Now, let us check that no type wants 
to deviate, The high-cost type obtains his monopoly 
profit in the first period and thus is not willing to deviate 
to another price that induces entry. Nor (according to 
equation 9.9) does he deviate to pt. And similarly for the 
low-cost type, Thus, we have obtained a continuum of 
separating equilibria,** 

It is generally felt that only one of these separating 
equilibria—the least-cost one—is “reasonable.” Suppose 
that firm 1 charges p; in [?,.p,), Such a price is dominated 
for the high-cost type; that is, the high-cost type is always 
better off charging pl! rather than p,, whatever its ex- 
pectations about the effect of price on entry. At worst, 
pl! triggers entry, and at best p, deters entry. Even if 
p, deters entry, the high-cost type makes more profit 
charging pl! on the basis of equation 9.9 and the con- 
cavity of the profit function: 


M# + 5D! > M¥(p,) + OME. 


Thus, firm 2, when observing p,, should not believe that 
firm 1 has a high cost. That is, p, should deter entry. This 
implies that the low-cost firm, to signal its type and deter 
entry, need not charge less than j, . 


29, See chapter 8. 
30. we beliefs (which are consistent with Bayes’ rule) are 
“reasonable” |will be discussed below. 


31, This corfinuum of separating equilibria exists for any x <1. For x= 1, 
however, the low-cost firm plays its monopoly price, ph, Thus, a tiny change in 
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the information structure may make a huge difference. A very small probability 
that the firm has a high cost may force the low-cost firm to lower its price 
discontinuously in order to signal its type. Games of incomplete information 
(which include games of complete information) are Very sensitive to the speci 
fication of the information structure. 
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To bummarize this rather long analysis: There exists a 


entry is not affected. First-period welfare is gen- 
erally jincreased because the low-cost type reduces its 
price. 


Thaugh it would be premature to conclude that limit 
pricing always increases welfare (see below), such a sepa- 
rating Pquilibrium shatters any illusion that low pre-entry 
prices bbviously reduce welfare. 


9.4.1.3) Pooling Equilibria 


The existence of pooling equilibria hinges on whether the 


condition 


xD} + (1 — x)D¥ <0 (9.12) 


beliefs pre the same as prior beliefs). This means that entry 
is not Heterred, so that the two types cannot do better 


than to choose their static monopoly prices. As these 
prices differ, no pooling equilibrium can exist. 

Assume, therefore, that condition 9.12 is satisfied, so 
that a pooling price p, deters entry. A necessary con- 
dition for a price p, to be a pooling-equilibrium price is 
that none of the types wants to play its monopoly price 
If one of them were to do so, it would at worst allow 
entry. Therefore, p, must satisfy condition 9.10 and the 
analogous condition for the high-cost type: 


Mf? — M#(p,) < 6(M# — D¥). (9.13) 


Again, the set of prices p, that satisfy both condition 
9.10 and condition 9.13 depends on the cost and demand 
functions. Let us simply note that, from condition 9.11, 
there exists an interval of prices around pi that satisfy 
these two inequalities. 

Now it is easy to see that if p, satisfies conditions 9.10 
and 9.13, p, can be made part of a pooling equilibrium. 
Suppose that whenever firm 1 plays a price differing from 
pi (an off-the-equilibrium path price), firm 2 believes that 
firm 1 has a high cost. Firm 2 then enters, and firm 1 
might as well play its monopoly price. Thus, from con- 
ditions 9.10 and 9.13, none of the types would want to 
deviate from p, >? 

Let us consider the pooling equilibrium in which both 
firms charge the low-cost monopoly price ph. It is easy to 
see that some of our conclusions for the separating equi- 
librium are reversed: 


Conclusion 1' The incumbent manipulates its price in a 
way that does not reveal cost information. There is less 
entry than under symmetric information (entry is always 
deterred, and not only with probability x). 


Conclusion 2' The low-cost type charges its monopoly 
price. The high-cost type engages in limit pricing to deter 


entry. 


Conclusion 3’ The consequences of asymmetric informa- 
tion for welfare are ambiguous. First-period welfare is, in 
general, increased, because the high-cost type lowers its 
price. But there is less entry, which, in general, lowers 
second-period welfare. 


32, We will nat consider here how restrictions on beliefs can eliminate some 
(or, possibly, all) equilibria, (One way to eliminate equilibria is to use the 
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Cho-Kreps criterion—see subsection 11.6.2) For more detail in the context of 
the limit-pricing game, see Cho 1986. 
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9.4.2 Discussion 


The Milgjom-Roberts model shows that limit pricing 


ent charges more than the monopoly price. 
incumbent is unsure whether he will deter or 
accommodate entry (for instance, because he does not 
know the ¢ntry cost), it is not clear whether he will charge 
more or leks than the monopoly price. Third, the previous 
model assimed that the incumbent's cost was unrelated 
to the entfant’s. Consider the other polar case, in which 
the entrar{t’s cost c) is equal to the incumbent's cost c 
but the egtrant does not know it before entering. Firm 
2's duopoly profit is then a function D3(c,). For many 
specificatipns of second-period competition, D; is a de- 
creasing finction of ¢,. (A symmetric decline in cost tends 
to increasp both firms’ profits.) Thus, to deter entry, the 
incumbenf must signal a high cost and therefore charge 
a price above the monopoly price in the first period 
(Harringtdn 1985). 

As in chapter 8, the principles established for the deter- 
rence of entry also apply to the inducement of exit ( preda- 


in order to convey that it has bad news about 
t’s future profitability.°* Thus, the model of 


firms compete in period 1, and that at the end of period 1 
firm 2 (still called the “entrant”) decides whether to exit 
on the basis of its observation of firm 1's price. (The only 
difference with the limit-pricing model is that the entrant 
is in at date 1.) In a separating equilibrium, the entrant 
is not fooled, and lower prices improve the first-period 
welfare. We already noticed that welfare may go down if 
the equilibrium is a pooling one; here we have another 
reason to be circumspect about the benefits of limit pric- 
ing. Introduce another period (period 0), in which the 
entrant decides whether or not to sink an entry cost. Even 
if the equilibrium is separating, so that the entrant will not 
be fooled in period 1, the incumbent's price is lower than 
under symmetric information, which reduces the entrant's 
expected profit. Thus, in period 0, the prospective entrant 
is less likely to enter.** Predation may be ex ante detri- 
mental, even though it is welfare improving when it takes 
place (in period 1). 

More generally, the Milgrom-Roberts model and its 
variants offer a framework for discussing tests of preda- 
tory behavior.** As in the case of “barriers to entry,” it is 
difficult to come up with a satisfactory definition of “pre- 
dation.” The intuitive notion refers to low prices (or high 
advertising levels) that are meant to induce a rival's exit 
so as to assure the remaining firm of higher profits in the 
future. As in the taxonomy of chapter 8, we can decom- 
pose the effect of a predator's action into two compo- 
nents. The first indicates how this action directly affects 
the predator's profit, if we take the rival's behavior (in 
particular, the exit decision) as fixed. The second reflects 
the influence of this action on the rival's behavior (a 
strategic effect). The predator, for instance, realizes that 
a low price may depress the rival's assessment of profit- 
ability in this market and may thus induce exit. One may 
define predation as the reflection of this strategic effect on 
the predator's behavior. These theoretical considerations, 
however, leave two issues open. The first is semantic: As 
we have seen, predation in the above sense is not neces- 
sarily socially harmful. As the word usually carries a nega- 
tive connotation, this raises the issue of whether we want 
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the incumbent's strategy and is not fooled. An example of signal-jamming 
predation is given in example 4 in section 11.5. 


34. The entrant stays out for a larger range of entry costs. 
35. See Ordover and Saloner 1987 for a good discussion of these tests. 
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to use it in cases where the strategic behavior increases 
welfare. The second issue is operational: As in chapter 8, 
it may be hard to disentangle the two components of the 
effect of the action on the predator's profit. A low price 
or a high fevel of advertising may simply be a competi- 
tive (“innpcent”) behavior intended to maximize profit 


st may not be related to predatory pricing. As 
Klevorick (1979, pp. 219-220) have insisted, 
ortant is the trade-off between the sacrifice of 


ator's currgnt sacrifice of profits. 

Although predatory behavior can be demonstrated 
within an abstract model, deriving empirical tests for anti- 
trust analysis is difficult indeed. How can one measure 
predatory |ntent if the predatory price bears no system- 
atic relatiof to cost? 


One possibility is to look at the incumbent's inter- 
temporal price path, For instance, suppose that the incum- 
bent firm cuts its price when entry occurs. This might 
indicate predatory behavior. However, the incumbent's 
residual demand curve falls when a competitor enters, 
which may call for a lower price independent of predatory 
intent (Williamson 1977). Joskow and Klevorick call this 
labeling of a truly competitive price cut as predatory a 
“type-I error.” Conversely, suppose that the incumbent 
does not lower his price when entry occurs. This does 
not mean that the incumbent does not engage in anti- 
competitive behavior. The incumbent may have been 
practicing limit pricing (charging a low price) before 
entry, and—having been unsuccessful in deterring entry 
—may have felt no need to cut his price further. Thus, 
the time-series analysis of the incumbent's price may lead 
to a “type-II error,” in which predatory behavior goes 
undetected. More generally, any test will have to face 
these two types of errors. 

One drawback of games of incomplete information*® 
is the multiplicity of equilibria, In the limit-pricing game, 
what should firm 2 believe when it observes a price p; 
that is not an equilibrium price? Because such an event 
has zero probability, Bayes’ rule cannot be applied. In 
contrast with the well-defined beliefs that characterize 
equilibrium strategies, any posterior beliefs are consistent 
with Bayesian updating,*’ Now, this engenders a large 
multiplicity of beliefs. Suppose that we do not want p, 
to be chosen by some type of firm on the equilibrium 
path. If observing p, is indeed an off-the-equilibrium-path 
event, we can choose posterior beliefs x'(p, ) following p, 
as we like; so let us assume that x’(p,) is very unfavorable 
to the incumbent. (Here x'(p,) = 0, so p, triggers entry.) 
Then, depending on p,, it is quite possible that no type 
wants to choose p,. This fulfills our earlier assumption 
that p, was off the equilibrium path. One way to reduce 
the multiplicity—a way that has been explored intensely 
in game theory in recent years—is to impose some 
“reasonable” restrictions on beliefs following a zero- 
probability event. (We used such a restriction earlier to 


36, See Milgrdm and Roberts 1987 for a fuller discussion of various limits of 
this approach, 


37, Suppose tHat the low type chooses p, with probability a and that the high 
type does so q 
through the f 
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x frat + (1 — aja") = aa 


For a positive-probability event (2! +2!" > 0), x is uniquely defined. For a 
zero-probability event (a! = 2" = 0), any 2° in [0, 1] is admissible. 
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single outa unique equilibrium among separating equilib- 
ria; for further “refinements,” see the Game Theory User's 
Manual arjd the references therein.) Another route, taken 
by Matth¢ws and Mirman (1983) and Saloner (1981), is 
to assumelthat the incumbent's first-period price** is ob- 
served by|the entrant with some noise. For instance, the 


some histgry-dependent threshold. The entrant may want 
to wait to}obtain more precise information, and therefore 
entry is in}general delayed relative to the full-information 
case. However, as in the Milgrom-Roberts model, the 
limit-pricing behavior does not bias the delay in a clear 
direction, hs it is anticipated by the entrant. (The delay is 
due to thd noise ant the entrant's willingness to learn.) 


9.5  Preflation for Merger 


The purpdse of this section and the next is to apply the 
logic of sdction 9.4 to predatory behavior. As we saw in 
that sectign, the limit-pricing model can be reinterpreted 
as a predation model. However, several economists close 
to the Chicago School—McGee (1958, 1980), Telser 
(1966), ard Bork (1978)—have argued that it cannot be 
rational td engage in predatory pricing in order to induce 
exit, Theif argument is that merging with the rival (if 
legal) is a uperior way to realize monopoly power. Com- 


38. The authafs cited actually use quantity rather than price. 


374 


come. Hence, episodes of price cutting must be attributed 
to other, more innocent considerations, such as a fluctua- 
tion in demand or cost or a normal reaction to a decline 
in the residual demand curve due to entry. 

Before games of asymmetric information became stan- 
dard practice, Yamey (1972) criticized the above argu- 
ment on two grounds. First, the predator may face poten- 
tial entry in the market in question or in other markets in 
which it operates. If it does not fight in the market in 
question but prefers to take over its rival, it may be 
perceived as a weakling (for instance, a high-cost firm that 
is afraid to engage in a price war). This may encourage 
other firms to enter (possibly in the other markets), be- 
cause they anticipate that, rather than being preyed upon, 
they will make profits in those markets or else will be 
bought out at advantageous prices. Second, the buyout 
price of the prey may not be independent of the preda- 
tor's pre-merger market behavior, Clearly, these two ob- 
jections are linked to private information and reputation. 
Under complete information, predatory pricing would 
not affect the perceived profitability of the prey or of 
other potential entrants.*? 

We will take up Yamey’s first argument in section 9.6. 
We will now examine his second argument in light of 
Saloner’s (1987) work. But before doing so, let us see 
how these two arguments can be motivated with exam- 
ples taken from the great merger wave of 1887-1904 (a 
time in which there were few impediments to horizontal 
mergers—see Scherer 1980, pp. 121-122). 

Between 1891 and 1906, the American Tobacco Com- 
pany and two affiliated corporations acquired 43 of their 
competitors. American Tobacco engaged in predatory pric- 
ing (sometimes of the signal-jamming variety—secretly 
controlled subsidiaries, known as “bogus independents,” 
secretly cut prices, so that the rivals would attribute the 
decline in their profits to intensified and enduring competi- 
tion or to declining demand). Burns (1986) offers evidence 
that predatory pricing considerably reduced American 
Tobacco's cost of acquiring its competitors (both those 
that were victims of predation and the competitors not 
preyed upon yet, who sold peacefully). Another famous 
case is that of the Standard Oil Company of New Jersey. 


39. “A bout of price warfare mitiated by the aggressor, or a threat of such 
activity. might serve to cause the rival to revise its expectations, and hence alter 
its terms of sale to an acceptable level.” (Yamey 1972, p, 130) 
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prices may|not have followed predatory intents.) 

The ratignale behind predation for mergers was recently 
formalized |by Saloner (1987). To understand his insight, 
consider the following bare-bones model. Its structure 
is the limif-pricing model, except that after the entrant 
enters and|before second-period price competition takes 
place the incumbent can make a merger offer to the en- 
trant.*° If the merger price is turned down, duopoly com- 
petition ensues in the second period, as was described 
above. If the merger is accepted, firm 1 acquires firm 
2's assets and technology and remains a monopoly. For 
simplicity, fet us assume that the entrant learns the in- 
cumbent's fost after turning down the merger offer and 
before wagjng price competition. Let us alter equation 9.8 


0. (9.14) 


were not bought out. Let us.also assume that a monopoly 
makes mor¢ profit than a duopoly.*! 


-cost type engages in limit pricing; it charges 
pi < pk, where p} is defined by 


pi) = 6(D} — Db). (9.15) 


ing pm the entrant accepts being bought out 
if and only jf the merger price exceeds DY; after observing 
py, he accepts merger prices above Dt. The incumbent 
rs Di! following p!t and D} following pt. 
Equation 9.[5 says that the high-cost type has no incen- 
tive to mashuerade as the low-cost type. By doing so, it 


saves D3! — Di on the merger price, but it incurs a first- 
period profit loss of M{! — M¥(p?). 

Would the low-cost type not find it profitable to charge 
its monopoly price? As in the limit-pricing game, assume 
that prices above pj lead to beliefs that the cost is high. 
By charging his monopoly price, the incumbent gains in 
the first period*? 


Mr — M¢ (pt) < Mf — M¥ pf), 


But he raises the merger price by D}! — D4. Thus, the 
low-cost type does not charge its monopoly price (and, 
4 fortiori, does not mimic the high-cost type), under the 
assumption that it wants to merge if it is perceived as 
having. a high cost. For simplicity, we make that assump- 
tion. (For this, it suffices that the low-cost type gains 
more from a merger than the high-cost type, On this, 
see the discussion in subsection 9.4.1.1 of how the value 
of remaining a monopolist varies with the incumbent's 
cost. Saloner analyzes quantity competition in the second 
period, making the more satisfactory assumption that 
the incumbent's cost is not revealed to the entrant imme- 
diately before period-2 market competition and showing 
that under some conditions a merger is indeed more valu- 
able to a low-cost type.) As in the limit-pricing model, 
there exist pooling equilibria. 

An interesting feature of the separating equilibrium is 
that limit (predatory) pricing improves welfare relative to 
the symmetric-information case. The merger occurs in any 
case, and the only issue is the sell-out price. The incum- 
bent's behavior resembles that of an aggressive army that 
attacks only in order to be in a superior bargaining posi- 
tion at the armistice talks. Fortunately, the welfare con- 
sequences may be different. Here, asymmetric information 
and the concomitant impulse to signal bad news to the 
entrant improves welfare through lower prices.** 

In this model, firms always merge. In Saloner’s variant 
there is a third firm, which would be ready to enter if the 
merger were to occur and if it believed the incumbent to 


competition betWveen the two firms, The fact that the same effect arises with 
first-period etition is important because it more clearly addresses the 
question of eliminating an existing rival 


41, Formally, le} M' denote firm 1's monopoly profit when it owns firm 2's 
assets or llogy for += L.H. (Mj > Mf; for perfect substitutes and if 
> c]. Mj = Nf: firm 1 then only acquires the right to prevent its rival from 
producing.) The pssumption can be written Dj + D} < Mj 


40, Saloner ao the slightly more complicated model with first-period 


42. Recall that Mj > Mi"(p}), that pi, < pi, and that 
Fy Mire) — Mp. )1 >a 


43, Whether mergers should be allowed is not clear cut, A merger may 
have the advantage of eliminating an inefficient competitor. However, it badly 
affects the consumer surplus by raising the market price(s) in the second period, 
See Saloner 1987 for a comprehensive discussion of the welfare trade-offs in 
this kind of model. 
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may wart to cut its price and build a reputation for 
toughnest in order to induce its rival in its market to stay 


out, to ekit, or to sell out at a low price. Naturally, a 
multimarket incumbent may adopt the same strategy in 
one market to improve its position in another market. By 
a rough fnalogy, the second-period market competition 
of sections 9.4 and 9.5 can be reinterpreted as competi- 
tion in al second market. For instance, suppose that an 
entrant ehters (at some cost) into market 1. The incum- 
bent, whp is still a monopolist in market 2, may have an 
incentive|to prey on the market-1 entrant to signal that 
his costs pre low. Even if such a strategy does not induce 
exit (and therefore loses money) in market 1, it may 
prevent dntry by another entrant (possibly the same firm) 
in marke} 2. In order for this strategy to be viable, the 
incumbert’s costs must be strongly positively correlated 
between Imarkets. One may, for instance, think of markets 
as geogrpphical units, and of the incumbent's good in 
each market as being the “same” and thus being produced 
at the saine cost (except for transportation costs).** 

Let us}consider a few alleged examples of such multi- 
market pfedatory behavior. Recall first Burns’ (1986) dem- 
that American Tobacco succeeded in reducing 
acquisitidn costs by preying on some rivals. One remark- 
able finding is that the gains applied not only to the 
acquisitidn of the prey but also to that of the competitors 
who sold out before being preyed upon. This strategy was 


much money on its overall product line (that is, 
it could practice discrimination to limit predatory losses). 


44, Posner (1979, pp. 939-940), following Yamey, argues that predation by a 
multimarket firm is “a plausible policy for a profit-maximizing seller to follow.” 
He suggests|that such a strategy may have been adopted by Standard Oil 


45. See Sclerer 1980 (pp. 335-340) and Schmalensee 1979 for further 
examples 


46, See Easterbrook 1981. This example is worthy of meditation in the light of 
the incomplfte-information approach. 
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Another example (Scherer 1980, p. 338) is that of General 
Foods, the producer of Maxwell House Coffee, which 
controlled approximately 45 percent of regular-coffee sales 
in the eastern United States. A rival brand, Folger’s, popu- 
lar in the western states, tried to expand eastward in the 
1970s. General Foods counterattacked with sharp price 
decreases where Folger’s had entered, and succeeded in 
stopping further entry into the northeastem states (though 
it did not induce exit from the already-penetrated cities).** 
Ina less successful case of multimarket predatory behav- 
ior in the 1970s, the Empire Gas Corporation, which sold 
liquified petroleum gas in many geographical markets 
while most of its competitors sold in only one, tried to 
signal from one market to another by charging less than 
the wholesale price in some markets. This was meant to 
induce the competitors to raise their prices well above 
this wholesale price. Although this led to either exit or 
higher prices in some markets, the rivals’ price increases 
eventually collapsed; furthermore, even where rivals had 
exited, other firms entered. (The barriers to entry were 
limited.) Empire lost money in the whole process.*® 

Multimarket predation was first formalized by Kreps 
and Wilson (1982) and Milgrom and Roberts (1982b),47 
Elsewhere in this book we have applied these authors’ 
reputation model to the reputation for being a high-quality 
producer (section 2.6) and to the reputation for coopera- 
tive pricing behavior (section 6.5). There is clearly no 
point to developing an argument that is basically the 
same here,** so their model of multimarket behavior will 
simply be summarized. 

Kreps, Milgrom, Roberts, and Wilson look at a simpli- 
fied version of the previous predation game, which they 
then extend to several markets. In a given market, the 
entrant decides to enter or not. In case of entry, the 
incumbent has two possible courses of action: predation 
or acquiescence. The entrant's profit (including the entry 
cost) is positive if the incumbent acquiesces, and negative 
if he preys. One can think of preying (acquiescing) as 


47. See also Easley et al. 1985. This line of research was stimulated by Selten’s 
(1978) chain-store paradox, according to which, under symmetric information, 
no predation occurs, no matter in how many markets the incumbent faces 
potential entry. 

48. A two-market version is studied as example 1 in section 5 of the Game 
‘Theory User's Manual. 
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charging a low (high) price. The incumbent can be either 
“strong|’ or “weak.” If he is strong, he enjoys preying and 
he always preys. If he is weak, preying is costly and can 
be worthwhile only if it raises profits in another market. 
Only the incumbent knows whether he is weak or strong. 
(Strength can be thought of as a rough approximation for 
having p low production cost.) 

The {wo-market version of the game is as follows: The 
incumbgnt is in both markets. In the first stage, the market- 
1 entrant decides whether to enter; the incumbent then 
decides|whether to prey (if entry has occurred), In the 
second tage, another entrant, who has observed the out- 
come ir| market 1, decides whether to enter market 2; if 
he entefs, the incumbent decides whether to prey. (The 
n-market version of the game is similar.) In such a context, 
it may pe worthwhile for a weak incumbent to prey if 
entry 's in market 1. If it were not worthwhile, preda- 
tion in market 1 would then reveal a strong incumbent; 
thus, after predation, entry would not occur in market 2. 
If the gain from monopolizing market 2 exceeds the preda- 
tion cost in market 1, it cannot be an equilibrium for the 
weak in¢umbent not to prey in market 1. (Clearly, if entry 
occurs jp market 2, the weak incumbent then has no 
incentive to prey, because there is no future—i.e., no 
reputation to maintain.) 

In tha T-market or T-stage version of the multimarket 
predation model, the equilibrium has the following fea- 
tures (if the probability of being strong is not too small): 
In the first markets, entry does not occur. If a firm entered 
by mistake, it would be preyed upon with probability 1 
(ie, by|the strong and the weak types). Because the 
number bf markets to defend shrinks over time, concerns 
about reputation lessen; this encourages entrants to enter, 
possibly|stochastically (the weak entrant preys with prob- 
ability less than 1). Another point of this reputation story 
is that, fh T large, the probability that the incumbent is 


strong chn be very small and still create an incentive for 
the weak type to prey. For a similar result, see exercise 
29, 


9.7. The “Long Purse” Story 


In the theories of predatory pricing sketched in sections 
9.4 through 9.6, the predator tries to convey bad news to 
its rivals about their profitability in its market(s), The 
predation does not affect the rivals’ real prospects, only 
the perception of these prospects. Under rational expecta- 
tions, predation may or may not succeed in driving (or 
keeping) rivals out of the market. 

An alternative and popular theory of predatory pricing 
holds that cutthroat competition may affect the rivals’ 
prospects, as the rivals cannot raise enough resources to 
carry on. According to this theory, a firm with substantial 
financial resources—the “long purse” or “deep pocket” 
—can prey on a weaker rival. Because the strong or big 
firm can sustain losses for a longer period of time, it 
can drive the weak or small firm out of the market.*? If 
this theory is right, predatory pricing has serious con- 
sequences. First, predation is more likely to be successful 
than in the signaling model; therefore, it more often leads 
to the monopolization of the industry. Second, the re- 
maining firm need not be the most efficient one; rather, it 
is the strongest financially. To monopoly pricing may be 
added cost inefficiency 

The “long purse” story lacked theoretical foundations, 
and it slowly fell from grace. Scherer writes that “per- 
haps because it enjoyed such frail intellectual support, the 
... theory received much less emphasis in conglomerate 
merger cases in the 1970s” (1980, p. 560), Telser’s influ- 
ential 1966 article raised suspicion about the traditional 
“long purse” story. In that article Telser assumed that the 
ability of one of the firms to raise capital is limited by 
some upper bound. By charging a low price, that firm's 
rival can impose a loss until the upper bound is reached 
and the financially constrained firm has to quit. But, as 
Telser notes, predatory pricing will not even occur in equi- 
librium. The financially constrained firm, realizing that it 
will be preyed upon and lose money, has an incentive to 
quit as early as possible (or not to enter).°° 


49, "An enterprise that is big in this sense obtains from its bigness a special 
kind of power. based upon the fact that it can spend money in large amounts. 
If such @ cofcern finds itself matching expenditures or losses, dollar for dollar, 
with a subst{intially smaller firm, the length of its purse assures it of victory. . 
The large « y is In a position to hurt without being hurt.” (Edwards 1955. 
pp. 334—S3$) See also Scherer 1980, pp. 214-215, 335-336, 560. 
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50. Benoit’s (1983, 1984) game-theoretic treament of Telser’s point shows that 
the absence of observed predation is due to the complete-information structure 
of the game. If the strong firm does not know whether its rival is financially 
constrained, it may pay for the latter to bluff when it indeed is financially 
constrained and pretend that it is not (such a strategy resembles that of the 
reputation model of Kreps. Milgrom, Roberts, and Wilson, studied in section 
96). 
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One problem with this approach is that it is not clear 
why the prey faces a financial constraint. Suppose that, 
instead of limiting their credit, the creditors give an in- 
finite credit line to the prey. Then there is no scope for 
predatiqn. The predator acquiesces, and the “prey” makes 
profit ech period. So it is in the interest of the credi- 
tors nol to impose financial constraints. Recognizing this, 
Fudenbérg and Tirole (1985, 1986b) argue that imperfec- 
tions in|the capital market are central to a reconsideration 
of the “long purse” story. Let us sketch how one may go 
about fqrmalizing these imperfections. 

Let uk first consider an entrepreneur (he will later be 
interpreted as firm 2) who must finance a project through 
debt. Lat K denote the size of the investment, and let 
E denote the entrepreneur's wealth (equity). Thus, the 
eur borrows an amount D = K — E from a bank. 


|. If r denotes the rate of interest charged by 
the bank, the entrepreneur must reimburse D(1 +r). If 
+r). he does so and retains f1 — D(1 + r) > 0. 
(1 + 7), his firm goes bankrupt and retains noth- 
‘uptcy costs (legal and administrative costs, or 
the product market due to disruption) are equal 
to B, so} the bank, which has priority in the bankruptcy 
process, retains f1 — B. If F denotes the cumulative distri- 
bution of fT (with density f), the firm's expected profit is 


U(D,r) [ (fl — pa + n)if(fiyd fi. 
Dii+r) 


The bank's expected profit is 


V(D,r) + (1 + D(1 — F(D(1 + ¥))) 


Dik +e) 
+ i) (f1 — B)f(flyaf, (9.16) 
o 


Suppose|that banks are competitive, and that the cost of 


their funds is I + ro. The bank's zero-profit condition can 
be written as 


V(D,r) = (1 + ro)D. (9,17) 


Assume that equation 9.17 defines a unique rate of inter- 
est, r(D), and that dr/dD > 0,51 

Will the entrepreneur invest in the project? Only if his 
expected profit exceeds (1 + ro)E, the opportunity cost 
of his equity. Where W denotes the entrepreneur's net 
benefit from the project, this project is undertaken if 

ri 

We (ff — (1 + )(K — B)\f(Myaft 


(K-E\ +r) 
—(1+n)E>0, 


Using equation 9.17 and letting Efl = jf Mf(f)dfl de- 
note the expectation of fl, we can rewrite W in a simpler 
manner; 


W = [EM — (1 + ro)K] — [BF(1 + r)(K—E))}. (9.18) 


where the first term is equal to the project's net value in 
a perfect financial world and the second term represents 
the expected bankruptcy cost. (The firm goes bankrupt if 
Mh <(1+r(K—E)) 

It is now easy to see that a higher wealth or equity 
makes it more likely that the project is undertaken; A 
higher equity lowers debt (K — E) as well as the rate of 
interest, so dW/dE > 0. We can interpret this result as 
follows: A higher equity lowers the probability of bank- 
ruptcy and therefore reduces bankruptcy costs. From the 
zero-profit condition, these cost savings are passed on to 
the entrepreneur (see equation 9,18) and make the project 
look more attractive.*? 

The foundations for an understanding of the optimality 
of a debt contract between the bank and the entrepreneur 
have been laid by Diamond (1984), Gale and Hellwig 


51. The ratq of interest r defined by equation 9.17 exceeds ry because the bank 
gets a returd r when M1 is high, and less than ¢ in case of 

For a giyen D, there may not exist 4 rate of interest that satisfies the 
zero-profit cbnstraint. The bank may make negative profit for all r. The problem 
is that, by rajsing +, the bank raises the probability of bankruptcy and therefore 
raises the bafikruptcy costs, so its profit may decrease, The derivative of V with 


respect to r & proportional to 
1—F(D(t 4) — BADIA + 9), 
The derivatife of 
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V(D.r) — (1 + r)D 


with respect to D is negative if, e.g.. B is “small” Thus, if the sum to be reim- 
bursed, D(1 + 1), is close to the maximum profit, f1, the profit decreases with r 
(assuming that f is bounded away from zero), We ignore this issue and assume 
in the following that | — F — Bf is positive in the relevant range and that there 
exists a zero-profit rate of interest (sufficient conditions for this are easy to 
find). 


52. Alternatively, the entrepreneur may not be able to find a bank that is 


willing to finance the investment, if we relax our assumption that there exists a 
rate of interest at which the bank breaks even. 
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d Townsend (1979), These authors assume that 
ot observe the firm's profit without inspect- 
's accounts. The cost of doing so is B./ When 


(ie. to 
yields a 


mburse D(1 + r)). Asymmetric information thus 
bt contract as the optimal contract. 


rm 2's equity after the first period depends on 


riod profit and therefore reduces its second- 
ity. Firm 2 must borrow more and finds con- 


et.5 And the prey exits the market either 
(because the rate of interest charged by the 


bank and firm 2 have an incentive to write. before period 2, a 
long-term cohtract stating that firm 2 will always get financing (ie. will borrow 
K — E whatever E is)? Doing so eliminates firm 1's incentive to prey and thus 
raises firm 2s profit, Thus, it may pay to sign such a contract if it will be 
1 and if the bank and the firm can commit to abide by this 


preys in the frst period, so that financing the second-periad project is no longer 
profitable fod firm 2 (given the bank's zero-profit condition). Then firm 2 and 
save their profits by renegotiating the contract and not going 
through witl the project. Thus, the long-term contract is not credible unless it 
is the same ps the short-term contract in the text (according to which the 


project will be financed only if W(E) > 0). 
Bolton Scharfstein (1987) also consider the link between long-term 
financial cor and predation, Their model differs from that of Fudenberg 


and Tirole inftwo respects. First, a firm's incentive to repay a loan is linked not 
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bank is too high relative to the opportunity value of the 
firm’s assets) or else because it cannot find financing at all 
(there is no rate of interest at which the bank can break 
even). This model addresses the effect of financial con- 
straints on expansion, as well as the effect on exit. Exit, 
with or without bankruptcy,** is not necessarily the most 
common outcome of such a predation (see Scherer 1980, 
p. 214). As in our model, the prey can stop expanding or 
modernizing rather than run the risk of going bankrupt. 
Also, the prey can sell out its assets to the predator.** As 
Saloner (1987) notes, the “failing-firm defense" (a rarely 
used provision in the U.S. merger law that allows the 
acquisition of a company that is unlikely to survive as 
a viable competitor in the absence of a merger—see 
Scherer 1980, p. 555) may well encourage predatory be- 
havior, (This reasoning assumes that courts have difficulty 
assessing predatory behavior, Otherwise, as long as pre- 
dation is an offense, the prey can sue for treble damages, 
and this may be a better deal than it can get in a merger.) 

The “long purse” story relies on the presumption that 
outside financing is more costly than inside financing 
(retained earnings). Some asymmetric information between 
the borrower and the lenders must underlie this cost dif- 
ferential. As Roberts (1987) notes, one institution that 
alleviates the financing problem for new firms (which are 
particularly likely to face financial constraints) is venture 
capitalism. Venture capitalists are often very involved in 
the day-to-day operations of a firm, which reduces the 
asymmetry of information between the creditors and the 
firm and thereby lowers the cost of financing new pro- 
jects. Equity, however, has its own costs.*° Much work 
remains to be done to determine the exact link between 
financial markets and industry structure. 


with the threat of bankruptcy but with that of no refinancing in subsequent 
Second, they consider debt contracts that are either observable or 
unobservable by the predator, whereas Fudenberg and Tirole look only at 
unobservable debt contracts, Bolton and Scharfstein highlight a trade-off 
between incentives to perform (ie. repay the debt) and vulnerability to 
predatory behavior. Creditors can only respond to the threat of predation by 
worsening the incentive problem (ie. raising the probability of refinancing). 


54. It would not be difficult to introduce an equilibrium probability of bank- 
ruptcy in this model. 


55. Or to a financially solid rival (which also has the effect of increasing 
industry concentration). 


56, See section 3 of Holmstrom and Tirole 1987 for a review of the pros and 
cons of debt and equity financing. 
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9.9 Supplementary Section: 
Darwinian Selection in an Industry 


In this section, we take up the model of the war of attri- 
tion (see chapter 8) to develop an example of “bilateral 
predation.” We consider an industry in which there is 
jally a surplus number of firms, so that the departure of 
some is a necessary condition for the survival of the 
others. Predatory behavior is passive, in that each firm 
waits for the other firm(s) to exit first. 

The war of attrition studied here, unlike that in chapter 
8, has asymmetric information about payoffs. Each firm is 
uncertain about its rivals’ profit or cost in the market. 
Behind the waiting strategy lies the secret hope of each 
firm that the war of attrition will prove very costly for its 
rivals. If this does not happen and a firm’s rivals insist 
upon staying in the industry, the firm must eventually 
leave the industry, As we shall see, a war of attrition 
selects the healthiest or the most motivated firms (that is, 
those whose actual losses are smallest or whose future 
profits are highest). Beyond this Darwinian flavor,*” the 
asymmetric-information war of attrition has the advantage 
of allowing firms to make positive expected profits.**'? 

The idea of the war of attrition was pioneered by 
theoretical biologists (see, e.g., Maynard Smith 1974 and 
Bishop et al. 1978). Riley (1980) and Kreps and Wilson 
(1982) introduced an asymmetric-information version to 
the realm of economics. Kreps and Wilson demonstrated 
that although a war of attrition resembles a “war of en- 
trenchment” and although predatory pricing behavior of 
the sort discussed in section 9.6 above resembles more a 
“war of extermination,” the two behaviors are closely 
related in a formal sense: They both are part of the same 
signaling methodology.®” 

Consider a simple example of a war of attrition in 
an industry with increasing returns (i.e, a stripped-down 
version of the Fudenberg-Tirole [1986c] model), There 


57, There alsofexist competitive (as opposed to strategic) models of selection 
in industries, The idea that the unfit are swept away is developed in Alchian 


‘no expected prpfit: For a firm to be willing to fight, it must be the case that its 
rival will give fp with positive probability. From this, we concluded that each 
firm is indiffer¢nt between staying and exiting at each instant. which means 
that both ex arte expect no profit. See Milgrom and Weber 1986 
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for a discussion of the link between complete-information and incomplete- 
information wars of attrition. 


59. A further advantage is that under some conditions the equilibrium is unique 
(see below); there exist several equilibria under complete information. 


60. To see the analogy, note that if a multimarket incumbent (see section 9.6) 
acquiesces to entry in a market and reveals that predation is costly to him, he 
gives up the potential benefits of future entry deterrence. Similarly, a firm that 
exits the market abandons the potential monopoly profit on this market. 
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are two firms, i = 1,2. At time 0, both firms are in the 


that it does so at moment T. Firm 2 becomes a 
list from that moment on. (We do not allow 
although the equilibrium we derive remains an 


y=) at—fyerat 

and 

Vz = |] cat pena + [ dt — pena 

V, and |V, are defined in an analogous way when firm 2 is 


its rival, it will earn a negative profit in the industry. At 
the same time, a firm that has fixed costs smaller than 
the duopoly profit will enter and will always stay in the 
industry, 

We have in mind here an industry in which the prob- 
ability that the fixed costs are greater than the duopoly 
profit is high (i.e, G,(TT*) is close to zero).°' Thus, the 
industry is similar to a natural monopoly, in that each firm 
may be capable of surviving as a monopoly but each loses 
money in a duopoly. In the war of attrition that follows, 
the firm that is first to leave is the loser. 

Each firm's strategy is simple: It amounts to a stop- 
ping time T, at which firm i leaves the industry if firm j 
has not done so before. Of course, this stopping time 
depends on the fixed cost of the firm: T,(f). Therefore, 
let (7, (A), Ta(f)} be the two stopping times conditional 
on the fixed costs. (Since fixed costs are private informa- 
tion, each firm does not know the exact stopping time of 
its rival; rather, each has only a probability distribution 
over this variable derived from its probability distribution 
over the rival's fixed cost.) 

Suppose that firm 1, with fixed cost f,, chooses stop- 
ping time T. Then the present discounted value of its 
expected profit over time is equal to 


Tr 
Prob|T,(f.) > T] ‘| (II? — fy edt 
6 


THA) 
+ [ ({ (Tt — feat 
(ainu<t) \Jo 


+ [ (ri — a 
Taf) 


We seek the T = T, ( f,) that maximizes the above expres- 
sion. (The maximization exercise for firm 2 is analogous.) 
Thus, we obtain a Nash equilibrium (a perfect Bayesian 
equilibrium, in the language of the Game Theory User's 
Manual). In spite of the complexity of the firms’ objective 
functions, the solution turns out to be quite simple. 

From the profit expression, it is easy to show that if 


f>f. 
Th) < TW fp).? 


18 Gi(I1*| > 0 is realistic. It also yields uniqueness of the solution. 
See exercife 9.2 for an example of multiplicity in the pure natural-monopoly 
case—i.e. the case in which G,(I*) = 0. 
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62. To obtain this, write the two “incentive-compatibility constraints”; that the 
firm prefers T;( f,) to T;( fi) when its cost is /, and the reverse when its cost is fj. 
Adding these two constraints yields the desired result, 
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In other Words, the higher firm i’s fixed cost, the sooner 
that firm lkaves (a Darwinian characteristic of this selec- 
tion). Mofeover, the function T; can be shown to be 
strictly decreasing and thus differentiable almost every- 
where.°* We define 

Ri) = (t) 

to be firm i’s level of fixed cost such that firm i exits at 
date t. 


We will now derive the conditions governing the func- 
tions F, (of. in an equivalent manner, the functions T,). To 
do this, we consider firm i at the moment (t) when its 
fixed cost]is such that it decides to leave—that is, when 
fi = F(t). Now suppose that firm i decides to stay in the 
industry until t + df and to leave at that moment if firm j 
has not left. We calculate the probability that firm j leaves 
during this time interval conditional on its 
done so before t. This last condition means 


Fiat}. 


i loses F(t) — 1" per unit of time, it must be 
the case that 


FW) — 11 -(-2 ew )\(@ —A), 


< (9.19) 
GE) 


with fixed cost equal to F(t), firm i could in- 
crease its| expected profit by moving its departure date 
forward dr backward. 

Equatign 9.19 and its analogue for firm j form a system 
of differehtial equations. To the latter are added the fol- 
lowing “Youndary conditions”: 

F(0) = TY" and lim F(t) = 1° for all i. 

tm 

The first $f these conditions comes from the property that 
a firm wil] enter only if it can survive as a monopoly (and 
if it does] enter, it at least waits to see whether its rival 
gives up |mmediately®*). The second condition is related 
to the fadt that the instantaneous probability that a rival 
will leave tends toward zero as time tends toward infinity 


nm 
Fit) 
n¢ 
Figure 9.3 


Selection in a war of attrition, (a) Stationary payoffs: infinite selection. 
(b) Nonstationary payoffs: possibility of finite selection. 


and to the fact that a firm that is willing to fight must 
sustain negligible duopoly losses. 

In equilibrium, selection may take a long time (in the 
sense that there is a positive probability that, at any mo- 
ment, the industry will still be shared by the two firms and 
a positive probability that one of the firms will have 
dropped out (see figure 9.3a)). A different result can be 
obtained if we generalize the model by allowing the pro- 
fit functions to vary over time. Fudenberg and Tirole 
(1986c) allow learning by doing as well as time-varying 
demands (ie., a growing or declining industry). In the 
case of the generalized model, the selection process may 
stop in finite time, as a firm that is not viable as a duopoly 
at date 0 may become so after demand has grown or 
production costs have decreased. Figure 9.3b represents a 
finite selection process in the case of growing profits. If 
both firms are still in at date fy, they will remain a duop- 
oly forever. Thus, at fy firm i must be indifferent between 
leaving and staying in a duopoly forever when it has cost 
F(to). This replaces the previous boundary conditions, 
lim F(t) = 11%, 


by 
Fito) = (ATI?) (to) for i = 1, 2, 
where (AIT#)(f) denotes firm i’s average (discounted) 


duopoly profit from date f on; ie., 


(Ang) () = [ T1d(s)r e-"*Yds, 


63, See Fuddnberg and Tirole 1986¢ for a full characterization of equilibrium. 
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04. This has a positive probability as long as G/T") < | 
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and TI}(s)| is firm i’s gross duopoly profit at date s.°° 
Fudenberg and Tirole also show that the perfect Bayesian 
equilibriurh of this game exists and is unique. 

Compamtive-statics results from this model are meager. 
However, |consider the symmetric case (G, = G, = G). 
It can be |shown that when the cost distribution shifts 
toward higher costs, in the sense of a higher hazard rate 
3(f)/G(f)}for all f, selection takes longer, in that T;( f;) 
increases fpr all f, < M1™.6° (Of course, to obtain the time 
distributioh of exit, one must also take the change in the 
distributioh of costs into account.) 

The welfare results, as in many signaling models of pre- 
dation, arelambiguous. On the one hand, if the duopolists 
succeed in|maintaining price collusion, the social value of 
competition is low and a social planner would like to 
hasten theJexit process. The monopoly rent is dissipated 
through dyplication of fixed costs during the war of attri- 
tion, rathe} than through low prices. In contrast, if the 
price unde} duopoly competition is close to the marginal 
cost, firms may exit too soon from a social point of view, 
as they dq not internalize the gain in consumer surplus 
associated With competition. 

There aye two meanings that can be given to the Dar- 
winian nofion that competition selects the fittest firm. 
The first iq that a given firm exits later when its cost is 
lower, We|noted that this property always holds in wars 
of attrition} The second meaning compares the two firms 
and states that the remaining firm is more efficient than 
the ata late This property is satisfied only for sym- 
metric profit functions and cost distributions. It is easily 
seen that if] the two firms’ costs are drawn from different 
distributions, competition may select the wrong firm (the 
one with the higher cost), 

Fudenbefg and Kreps (1987) ask: What is a firm's wil- 
lingness to |wage wars of attrition when it fights in several 
markets an} its private information is correlated across 
markets? Td make things concrete, suppose that firm i = 0 
is in N gepgraphically distinct markets. In market j, it 
faces firm j}(j = 1, ..., N). Its fixed cost per market, fo, is 
the same injall markets, and is distributed according to the 
distribution] Go( fy) from the viewpoint of its rivals. Firm 
i's fixed coft, f;, is drawn from the distribution G( f,). The 


N rivals’ costs are drawn independently. The markets are 
identical and (except for f,) independent. 

Suppose first that the behavior in one market is not 
observed in the other markets. Then firm 0 wages N inde- 
pendent and identical wars of attrition, each described as 
above. The exit behavior follows the differential equa- 
tions 9.19, 

Assume next that there is an informational leakage. 
Firm j observes what happens in market j’ # j. The N 
wars of attrition are simultaneous (i.e., they start at the 
same time), Suppose that reentry is prohibitively costly, 
so that when a firm exits its rival has conquered the 
market forever. Fudenberg and Kreps show that the equi- 
librium is unchanged, That is, playing wars of attrition 
against N opponents is equivalent to playing against one 
opponent (except that payoffs are multiplied by N); in- 
formation leakages do not matter. To see this, note that 
the differential equation for each firm j (j = 1,..., N) is 
unchanged. Suppose that the incumbent, if it exits one 
market, exits all remaining duopoly markets simultane- 
ously (it obviously does not exit monopolized markets, as 
reentry is not feasible). If N — k rivals have not yet con- 
ceded at some date f, then the cost of staying per unit of 
time is 
(N — 8)(fy — 11"). 


The expected gain from staying is also multiplied by 
N— k; itis equal to 


_ 8) py) =f 
w—n( $20 pu) (BEB), 


where F(t) describes the symmetric behavior of each of its 
rivals, Thus, the differential equation for the incumbent 
is unchanged, and for that reason the equilibrium is un- 
changed. Fudenberg and Kreps then investigate the role 
of the no-reentry condition and show that, under some 
assumptions about the nature of equilibrium when the 
incumbent has revealed its type, the information leakage 
between markets may benefit or hurt the multimarket 
firm. 


The following exercises pertain to incomplete-informa- 
tion wars of attrition. 


65, 1 11}(s) is increasing, (ATT#) (0) is also increasing, 
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66. Equation 9.19 provides the intuition for this. The higher g/G suggests a 
lower slope |F'(t)| for the exit function. 
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Exertise 9.2"** Riley (1980) considers a war of attrition 
hich two animals of the same species both desire the 
sam¢ food source or mate. Both use up valuable time to 
fight for the prize. Eventually, at some date #, one of the 
contestants departs, and has payoff —?. The winner has 
payoff 0 —t Animal i's valuation for the prize, v;, is 
private information and is drawn from a distribution G(-) 
with density g(-). (G(0) = 0, G(+00) = 1.) Let V;,(t) de- 
notd the valuation such that animal i drops out at date t 

(i] Use an intuitive argument to show that the differen- 
tial ¢quations for equilibrium are 


VV (Ag(Vi() = 1 — G(Vi(H). 


Suppose that G(v) = 1 — e~*, Show that there exists 
tinuum of equilibria indexed by K in [0, +90) such 


Exerqise 9.3"** Kreps and Wilson (1982) consider the 
ing war of attrition: There are two players, i = I, 2. 


, the game ends. Each player can be either “strong” 

probability p for player 1 and q for player 2) or 
weak” (with probabilities 1—p and 1 — q). A strong 
playfr enjoys fighting and therefore never concedes. A 
weak player 1 loses 1 per unit of time while fighting and 
mak¢s a > 0 per unit of time when its rival has conceded; 
a wenk player 2 loses 1 per unit of time while fighting and 
mak¢s b > 0 per unit of time when its rival has conceded. 
Thug, a weak player 1 has payoff a(1 — #) — # when it 
wins} at t and payoff —t when it concedes at f; similarly 
for player 2. There is no discounting. 

(i) Show that from time 0* on, the posterior beliefs p, 
and 4, of each player about the other must belong to the 
curvé q = pels 

(ii] Show that one of the weak types exits with positive 
probpbility at date 0 exactly (that is, a player's cumulative 
probpbility distribution of exit times exhibits an atom at 
). How are the weak types’ payoffs affected by a, b, 
p, and q? 


f= 
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Answers and Hints 


Exercise 9.1 


(i) No information sharing: Let p§ denote firm 2's ex- 
pected price. 


max[(1 — p; + p3)(p, —%)] 
yields 

pr =(1+ pi +c,)/2. 

Thus, 

y= Ep = (1+ p+ c*)/2. 


By symmetry, pf = pj = 1 + c*. Firm I's expected profit 
conditional on c, is thus 


EM! = (2 +c* — ¢,)?/4. 
a 


Taking the expectation of this expression with respect to 
¢, yields the result. 

(ii) Information sharing: c, and c, are common knowl- 
edge at the date of price competition. See chapter 7. 

(iii) Information sharing has two effects. First, efficiency 
is increased, because producers have better information 
about costs and demand. More efficient price or output 
decisions result from a finer information structure. Sec- 
ond, information sharing affects the nature of product- 
market competition, and makes it more or less “collusive.” 
In general, the private (and social) desirability of informa- 
tion sharing depends on the model specification. 

Most of the literature (like this exercise) considers linear 
demand functions, Clarke (1983) and Gal-Or (1985) show 
that information exchange about demand does not take 
place under quantity competition. Vives (1984) considers 
differentiated products and shows that whether informa- 
tion is shared depends on whether goods are comple- 
ments or substitutes and on whether firms compete in 
price or in quantities. Shapiro (1986) considers quantity 
competition and allows correlation between the costs; he 
shows that consumer surplus falls but profit and welfare 
go up when information is shared. 


Exercise 9.2 


{i) At time #, animal 1 (say) loses dt by waiting dt 
longer. But it wins a price v, with conditional probability 
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(V3 (t))] — [1 — G(Va(t + db))) 


1— G(V;(t)) 
(Va(F)) V3 (dt 
1— G(V2(H) © 


inition, the loss must equal the expected benefit for 
v, =|V,(H). This yields the differential equation. 


is that the Riley “economy” is a natural monopoly 
robability 1. That is, it is common knowledge that 


both |nembers are not viable as a duopoly (this would 
col nd to G,(I1*) = 0 for all i, in the text). This engen- 
ders 4 technical difference: In the text, G,(F(#)) tends to 


1=[l — q)pdlla(l — #1. 
But, frpm Bayes’ rule, 

4 = M1 — 4,)p,-°" 
These|two equations yield 
h = q/ald — 8) 


and, by symmetry, 
b= pb — b). 
Dividing, we obtain 
fie Re 

% aM 

or 

4 = kp. 


It is easily seen that k = 1. Suppose, for instance, that 
k < 1. p, would reach 1 at date fy < 1, while q,, = k < 1. 
Player 2, convinced that player 1 is strong, drops out 
immediately with probability (1 — q,,) > 0. Thus, a weak 
player 1 should wait at to — &. 

(ii) At date 0, the players are not on the curve q = p®* 
Thus, one of them must drop out with strictly positive 
probability so as to reach the curve and move along 
this curve from then on. For more details see Kreps and 
Wilson 1982. 


67. Derive this by taking the limit of Bayes’ rule in discrete time: 


att) 


a+ d= Gait adh 
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Chapter 9 


10 


Research and 
Development and the 
Adoption of New 
Technologies 


This chapter has two distinct parts. The first part (sections 
10.1 through 10.4) analyzes the private and social incen- 
tives to bring about technological innovations; the second 
is concerned with how already existing innovations are 
adopted in the marketplace. 

Research and development (R&D) is crucial, not only in 
the analysis of an individual industry, but also from an 
economy-wide viewpoint. Solow's (1957) discovery that 
only a small fraction of per-capita growth (10 percent for 
the U.S. nonfarm sector over the period 1909-1949) was 
associated with an increase in the ratio of capital to labor 
called economists’ attention to the role of technological 
progress in improving welfare.‘ This central role implies 
that careful attention should be given to the firms’ incen- 
tives to innovate and to adopt new technologies. 

It is customary to distinguish three stages of research: 
basic research aimed at deriving fundamental knowledge 
(pursued mainly at universities and government agencies); 
applied research associated with engineering; and devel- 
opment, which brings products and processes into com- 
mercial use. Then there is the post-research stage, during 
which the innovation diffuses through the industry via 
licensing, imitation of patented innovations, or adoption 
of unpatented innovations. 

It is also usual to distinguish between product innova- 
tions and process innovations. Product innovations create 
new goods and services; process innovations reduce the 
cost of producing existing products. Of course, it is not 
always possible to draw a clear line between the two 
types of innovation. One firm's new product may lead to 
a new process for another firm. Also, a product innova- 
tion can generally be regarded as a process innovation— 
imagine that the new product existed prior to the innova- 
tion, and that the innovation simply reduced its produc- 
tion cost. 


1. Certhinly a non-negligible fraction of the residual is due to factors other than. 
technicdl progress (see, e.g... Denison’s |1962] high estimate of the role of an 


improved education of labor). But there is no denying that technical progress is 
a major component of the residual 


tions 10.1 and 10.2 we will consider Schumpeter’s 
thesis about the link between market structure and 


?; we will focus on the 
second argument, which accords to innovation the status 
blic good, the supply of which must be encouraged 


matic 


imitatjon.) The dilemma of the patent system is that, in 
encouraging R&D, it prevents the diffusion of innovation 
and consequently creates a noncompetitive situation. 

Th¢ private benefits associated with an innovation pro- 
tected by a patent are considered in section 10,1. Sec- 
tion 90.2 introduces the cost side (the R&D production 
procets) and describes patent races. These two sections 
emphasize the relationship between these problems and 
those |of product selection and investment (which were 
considered in chapters 2, 7, and 8). After all, innovating 
amourts to creating new products. Therefore, the issue of 
rket's incentive to engage in R&D is similar to 
competitive product diversity. On the cost side, 


cai fan it Spracputa paicct2G. aeacharms al eran monopolist 
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tions, the public-good aspect of unpatentable ones). The 
welfare effects of patent protection are analyzed in sec- 
tion 10.3, Section 10.4 discusses alternative, more central- 
ized methods of encouraging R&D: the prize system and 
the contractual mechanism. The supplementary section 
introduces the possibility of licensing an innovation and 
suggests how this can affect the value of the innovation 
and the race for a patent. 

Inventing new products and processes is not sufficient 
for economic progress, The innovations must then be 
properly exploited and diffused through licensing, imita- 
tion, or simple adoption. (The difference between imita- 
tion and adoption is that an imitator must pay for reverse 
engineering—e.g., figuring out another firm’s technol- 
ogy.) The second part of this chapter is particularly con- 
cemed with the adoption process. We assume that a new 
technology is available to all firms at some cost. After a 
brief introduction, we analyze the preemptive aspects of 
the adoption of new technologies. In section 10.5 we 
consider whether, in oligopoly, the timing of adoption is 
likely to involve “bunching” (almost simultaneous adop- 
tion by all firms) or “spacing of adoption” between firms. 
Section 10.6 reviews the recent literature on standard- 
ization and network externalities. Network externalities 
exist when the consumers of a given good enjoy this 
good all the more when it is consumed by many other 
consumers (as in the case of telephone networks or video- 
cassette recorders), Such consumption externalities intro- 
duce interesting features to the adoption process. 


10.1 Incentives as a Function of the Market 
Structure: The Value of Innovation 


In his pioneering 1962 article, Arrow asked: What is the 
gain from innovation to a firm that is the only one to 
undertake R&D, given that its innovation is protected by 
a patent of unlimited duration?*:* Here we will attempt to 
isolate the “pure” incentive to innovate, ie., that which is 


does not have competitors ready to imitate his innovation or to circumvent an 
existing patent on this innovation 


3. The analysis here also follows Dasgupta and Stiglitz 1980 This section, 
as well as sections 10.2 and 10.8, draws from a survey by Guesnerie and Tirole 
(1985), 


4. The more realistic case of a patent of limited duration or an innovation likely 
to eventually become obsolete or be arcumvented 1s treated similarly 
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luction cost of a particular good from an initial 
| = to a level ¢ <€. Our thought experiment 


‘ket as well as with regard to R&D. Let [1 be 
unit of time. From the envelope theorem, we 


_ an dp™ on 


6p de ec 


_ene 
= Oe 


= — D(p™c)), 


where p™(c) is the monopoly price as a function of cost 
c. Therefore, the monopolist’s incentive to innovate is 
given by 


ve = + inc) — 0") 


= * [ Dip™e))de. 


Since p™(c) > ¢ for any c, we see that V" < V*. This 
is easy to understand, because monopoly pricing at any 
cost level yields underproduction as compared with the 
social optimum. Therefore, the monopolist’s cost reduc- 
tions pertain to a smaller number of units. This result is 
identical to that obtained in chapter 2, where it was 
demonstrated that, socially, a monopolist has too low an 
incentive to introduce a new product, because he cannot 
fully appropriate the social surplus (unless he can price- 
discriminate perfectly). 


10.1.3 Competition 


Finally, consider a situation that is competitive initially, A 
large number of firms produce a homogeneous good with 
a technology exhibiting marginal cost ¢. These firms are 
initially involved in Bertrand price competition, so that 
the market price is 7 and all the firms are earning zero 
profit. The firm that obtains the new technology exhibit- 
ing cost ¢ is awarded a patent. Let p™(c) be the monopoly 
price. There are two possible cases: p™(c) > f and p™(c) 
<T. In the second case, the innovating firm sets its mo- 
nopoly price and the other, less efficient firms produce 
nothing; then the innovation is called drastic or major. 
In the first case, the innovator is constrained to charge* 


5. Actually, he will charge * —« in order to corner the entire market, We 
assume here that the monopolist's objective function is strictly concave, so that 
he wants to be as close as possible to his unconstrained optimum. 
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p=T begause there is a competitive supply from the 


Dipm(e))de< + [ “ Dieyde = V*. 


On the other hand, D(@) < D(c) for all c <T. and there- 


Exercise 10.1* — Show that the value of a drastic process 
innovation for a competitive firm exceeds the value of 
this innovation for a monopolist but is still lower than 
that for a focial planner. 


Thus, ir} both cases, 


ve <Vik Vs, 

Let us sulmmarize this analysis. Even with a patent of 
infinite duration, the problem of appropriating the social 
surplus arjses (here the surplus is generated by the intro- 
duction of a new technique rather than a new product). 
Also, asidf from any strategic considerations, the monop- 
olist gaing less from innovating than does a competitive 


Exercise 14.2* V*, V™, and V* provide measures of the 


ictures. However, they are not adequate mea- 
le wish to consider, say, the effectiveness of 
research in the monopoly and competitive 
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cases. Let W™ denote the increase in welfare brought 
about by innovation when this innovation is exploited by 
the monopolist (i.e., when marginal-cost pricing does not 
hold). Thus, W™ is equal to V™ plus the change in con- 
sumer surplus. And similarly for W°. 

(i) Show that W™ > V™ and W* > V°. 

(ii) Show that for a drastic innovation W" > W*, What 
about a nondrastic innovation? (Hint: Consider the case of 
a linear demand D(p) = a — bp, and compute how W™ 
and W* change with ¢ for c in the neighborhood of &) 


Exercise 10.3" Consider an industry with n firms, linear 
demand q = a — bp, and symmetric cost ¢. Compute the 
private value of a drastic innovation that reduces cost 
from ¢ to c, assuming Cournot competition. How does 
this value change with n? 


Exercise 10.4** An industry is initially competitive. The 
price is equal to the firms’ marginal cost, ¢. Demand is 
linear: q = 1— bp. One of the firms (and only one) 
has access to a cost-reducing technology. It can obtain a 
technology that allows production at marginal cost c by 
spending ¢(c) = K(= — c)?/2. K is sufficiently large that 
the process innovation remains nondrastic. The process 
innovation is made at date 0, and the patent last T units 
of time (after which the technology c can be used freely 
by all firms). The firms wage Bertrand competition, and 
the rate of interest is r, Following Nordhaus (1969), com- 
pute the socially optimal patent length T*. How does T* 
vary with b? With K? Interpret. (Hint: Use the envelope 
theorem to compute the change in the profit of the in- 
novating firm that occurs with a change in the length of 
the patent.) 


10.1.4 Monopoly Threatened by Entry 


Consider the case of two firms in an output market, Be- 
fore innovation, firm 1 is a monopolist and produces at 
unit cost ¢. Firm 2, the potential “entrant,” has a very high 
(infinite) unit cost and currently leaves the entire market 
to firm 1, which makes a profit of IT™(¢). 

If the monopolist is the only one who can acquire a 
new technology that reduces the unit cost to c, we have 
the situation considered in subsection 10.1.2. In this case, 
the incentive to innovate for the monopolist is V™. 

If the potential entrant is the only one who can acquire 
the new technology, we have the situation of subsection 
10.1.3 so long as Bertrand-type price competition pre- 
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vails. Then the monopolist plays the role of a competitive 
fringe at|price ¢, and the value of innovation for the 
entrant is} V°_ 

Comparing these two cases suggests that innovation is 
more valtiable for the entrant than for the monopolist, 


issume that neither firm has an acquisitional 
over the innovation. For example, think of a 


et, generates the innovation and puts it up for 
tween the two producing firms. (A more plau- 


logy first; see section 10.2.) Here, the concept 
ie of the innovation is slightly different from 


vation is Inore valuable for the entrant than for the 
monopolis| 


Let I“(q, c) and M“(c,) be the profits per unit of time 


is still ©. Fdr the entrant and the monopolist, the innova- 
tion values|V* and V™ can be written as follows: 


TH*(chz) 


It is reasonable to assume that in a homogeneous-good 
industry a monopolist does not make less profit than two 
noncolluding duopolists: 


T"(c) > ME, c) + M%(c,z). (10.1) 


This property (called the efficiency effect*) must be verified 
for each competitive model, but it is rather intuitive: if he 
wishes, the monopolist can always duplicate the situation 
of the noncolluding duopolists. Note that in the pres- 
ent case the monopolist does not necessarily do strictly 
better. To see this point, consider the situation in which 
the innovation is drastic, so that p™(c) < f. If the entrant 
innovates, he completely eliminates the monopolist from 
the market, and the potential industry profit is not dis- 
sipated by competition. In this case, M*(¢,c) = 0 and 
TI*(c,¢) = M™(c), so equation 10.1 is satisfied with equal- 
ity. Equation 10.1 more generally implies that V" > V* 
(with equality for a drastic innovation), We conclude that 
because competition reduces profits, the monopolist's incentive 
to remain a monopolist is greater than the entrant's incentive 
to become a duopolist. Thus, if the monopolist and the 
entrant were to bid for the innovation, the monopolist 
would bid M*(c,7), obtain the property right on it, and 
remain a monopolist (Gilbert and Newberry 1982). 

As we shall see, this effect does not necessarily mean 
that in a race for a patent the monopolist will always 
innovate sooner than the entrant. We must also take into 
account the replacement effect developed above (i.e., the 
fact that the existing monopolist already earns monopoly 
profit before the innovation and, therefore, is in less of a 
hurry to innovate than an entrant who is “starting from 
scratch”). 

As Gilbert and Newbery (1982) note, the monopolist 
may want to obtain property rights on an innovation 
even though he will make no use of it. This occurs, for 
instance, if the patent relates to a production technology 
that is not superior to that of the monopolist. The only 
purpose of patenting then is to prevent the entrant from 
competing, A similar situation may occur when a pro- 
duct innovation is not sufficiently differentiated from the 
monopolist’s product to warrant his incurring the costs of 
introducing the new product; the monopolist may, how- 
ever, acquire the property right on the product innova- 
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©. We encountered this effect when discussing the persistence of monopoly in 
subsection 8.6.2. 
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tion in jorder to avoid competition, Thus, the efficiency 
jay be an explanation for patent “shelving.” ” 


introduction to Patent Races 


Except Jn subsection 10.1.4, we have considered only the 
pure intentive to innovate, ie, the gain from innovating 
in a hypothetical situation where a firm has a monopoly 
over R&D activities. However, a firm does not usually 
have sich monopoly power. Rather, R&D competition 
can be likened to a race for a patent. In this situation, each 
firm may wish to accelerate its research program at the 
cost of incurring additional expenses. This is reminiscent 
of the dent-dissipation postulate enunciated in chapter 1; 


10.2.1] A Model 


A simple model of a patent race is the “memoryless” 
or “Polsson” patent race associated with Dasgupta and 
Stiglitz](1980), Lee and Wilde (1980), Loury (1979), and 
Reingahum (1979, 1982).° The research technology is 
charactPrized by the assumption that a firm's probability 
of making a discovery and obtaining a patent at a point 
of tima depends only on this firm's current R&D expen- 
diture 4nd not on its past R&D experience. This assump- 


We can use this model to study the so-called persis- 
tence of monopoly. The issue, which was addressed by 
Gilbert and Newberry (1982) and Reinganum (1983),'° is 
whether a monopolist in the product market is more likely 
to innovate than an entrant. In order to connect our 
conclusions to some previously discussed points, we shall 
consider the case of a monopolist producing at marginal 
cost ¢ and a process innovation leading to cost c. Two 
firms, the monopoly (firm 1) and an entrant (firm 2), are 
competing in R&D activities. The first firm to innovate 
obtains and exploits a patent," For simplicity, we assume 
that the patent has an infinitely long life. Following the 
notation of subsection 10.1.4, we denote as II™(c) the 
profit per unit of time earned by the monopolist before 
the innovation. The potential entrant initially earns no 
profit in this industry. After the innovation, the profits of 
the monopolist and the potential entrant are 11"(c) and 0 
respectively if the monopolist obtains the patent, and 
TI*(é, c) and M1*(c,t) respectively if the entrant obtains 
the patent. As before, we assume that 


T™(c) > ME, c) + M%(c,2). 


We assume that if firm i spends {x,df} between time ¢ and 
time t + dt, its probability of making a discovery during 
this interval is h(x;)dt, where h is a concave, increasing 
function and h'(0) is “very large.” As was mentioned 
above, a firm’s probability of making a discovery at time 
t depends only on the flow of its expenditures at this time 
and not on its past expenses (experience, memory). Nor 
does the probability of a discovery here depend on the 
date or on the competitor's past research program. (One 
could obviously think of situations in which these last 


and Newbery mention the antitrust case in which the SCM 
Corporatfon sought more than $500 million in damages alleging that the Xerox 


taining innovations that were neither used nor licensed to others. 
‘1 and 452 of Scherer 1980 for further examples. For more on the 


lewal fee that goes up dramatically over the years, No such provisions 
US. patent law, although compulsory licensing is sometimes used 


i focus om the strategic aspects of the innovation process. There is a 
ture on the decision-theoretic approach (see Kamien and Schwartz 


394 


1982 for an extensive and clear review of this approach as well as for a good 
‘overview of the empirical literature). This literature assumes either that there 
is only one firm doing R&D or, more generally. that a firm's environment 
(including the rivals’ R&D expenditures) is taken as exogenous (so that, in the 
multifirm case, each firm assumes that its R&D spending has no influence on its 
rivals’ spending). Grossman and Shapiro 1986a contains a useful analysis of the 
optimal time path of R&D expenditures by a monopolist: (There also exists a 
search-theoretic literature that puts less emphasis on research expenditures and 
focuses on optimal search procedures and stopping time: see, e.g, Weitzman 
1979 and Roberts and Weitzman 1981.) 


9. See Reinganum 1984 for a review of memoryless patent races. 
10. See also Gilbert and Newbery 1984, Salant 1984, and Baldwin 1987. 


11, This “big bang” assumption is, of course, very debatable, Patents can be 
circumvented, new innovations can come along, and so on. Also, a patent may 
not be exploited immediately, because of uncertainty about the demand or the 
technology. 
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two elenjents might provide information on the prob- 
ability of discovery to the firm in question, either through 
general nological progress or through involuntary 
diffusion bf knowledge between firms.) 

In gemeral, R&D competition between two firms 


starting at that moment is identical to the initial game. 


‘ords, the absence of experience implies that 


e the R&D process is of the Poisson type, the 
that at time # none of the firms has made a 


absence 
profit of 


innovation before #, the monopolist realizes a 


(1"() —}x, Jat 

between Hand ¢ + dt. Moreover, with probability h(x, )dt, 
he is first Ho innovate, and he earns, starting from this mo- 
ment, a stream of profit over time with a discounted 
value of 

™\c)/r. 

With prolability h(x,)dé the entrant is the first to in- 
novate, ard starting from this moment the monopolist’s 
discounted value of profits over time is 


lenote the interest rate, we can write V, as 


tg ie WixaN 


im 4, 
ee) + Wey EE + a) EO) 


r a 


395 


_ FE) — x, + Ale )IM™(e)/r) + hx), c/n] 
; r+ h(x,) + h(x) 


In a similar manner, V, can be written as 


oa 


Bhs 
ene meiner) (ce) =x) 


2 


V3(41.2) = I 

° 
_ h(x.) (1G, E/n) = 23 
rhe) + hey) ~ 


A Nash equilibrium is a set of research intensities (x7, x3) 
such that x?maximizes V, given x7 (for any i). 

Which of the two firms spends more on research (or, 
equivalently, which one is more likely to be the first to 
innovate) depends on the two effects identified in the 
preceding section, The efficiency effect 


Tc) — ME, c) > M%(c.¢), 


reflected in the numerators of V, and V;, suggests that 
the monopolist has more incentives to innovate and, 
therefore, spends more on R&D. The monopolist gains a 
net flow profit of 


TI"(c) — TI4(€, c) 


by preempting the entrant, while the entrant gains only 
TI"(c,t) by being first. The replacement effect leads to 
the observation that the marginal productivity of R&D 
expenditure for the monopolist decreases with his initial 
profit: 


2 a, 
aITI"(E)] ax, 


This follows from the observation that, by increasing 2, 
the monopolist moves the discovery date forward (on 
average) and therefore hastens his own replacement. In 
contrast, the entrant does not forgo a flow profit when 
discovering. 

Either of the two effects may dominate. To see this, 
consider two extreme situations: 


+ First, consider the case of a drastic innovation, Since the 
entrant becomes a de facto monopolist in the case of 
innovation, there is no dissipation of monopoly rent, ie., 
no efficiency effect. Then the replacement effect must 
dominate, i.e., xf < x}. Reinganum (1983) derives this re- 
sult from the first-order conditions associated with the 
Nash equilibrium: 
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Therefore, we conclude that in the case of a drastic inno- 
vation thgre is a tendency (in a probabilistic sense) toward 
“entry” into the output market.!? 

+ In ordgr to eliminate the replacement effect, it is 
sufficient |to choose an R&D technology in which the 
amounts ¢ommitted per unit of time are considerable, so 
that the probability of discovery per unit of time is high. 
In this case, innovation is achieved early, and the mo- 
nopolist if much more concerned with the possibility of 
innovation by the entrant than with the date of his own 
“replacempnt,” Therefore, we have xf > x} (consider, for 
example, the family of technologies {h{x/A]}, where 2 
tends toward infinity'*; see Fudenberg and Tirole 1986). 
Consequently, for a nondrastic innovation, there is a ten- 
dency for the monopoly to persist, because the established 
firm has alhigher probability of obtaining the patent. 


Exercise 105°" Consider a symmetric patent race involv- 
. The firms initially make no profit and have 


hie) +r 


h(u) + (n 


Compute the firm's “reaction curve” y = R(x), where y 
denotes the firm's best research intensity given that its 
rivals chogse x. Show that the reaction curve is upward 
sloping, Interpret. 

(ii) Solve for the symmetric Nash equilibrium. How 
does the equilibrium research intensity vary with n? Show 
that if V is also the social value of the patent, there is 
socially top much R&D. 


10.2.2 Discussion 


Two refinements of the basic patent-race model have 
been developed recently. First, R&D expenditures are 
only one dimension of the invention process, In the basic 
model, firms hasten the date of discovery by spending 
more on R&D. In practice, they also must choose be- 
tween different technologies. They may choose technol- 
ogies that are more or less risky (eg, technology A 
may give a fairly deterministic date of discovery, whereas 
technology B may lead to faster discovery or turn out 
to be a dead end). Also, they may choose technologies 
that are more or less correlated with each other. Sec- 
ond, the basic patent-race model makes the unrealistic 
assumption that firms do not learn in the R&D process. 
This rules out strategic behavior of the kind studied 
for other investments in chapter 8 (e.g., first-mover 
advantages). 


10.2.2.1 Choice of Technology 


How does the race for a patent affect the choice of R&D 
technologies? 

Let us begin with the risk dimension, Dasgupta and 
Maskin (1986), Judd (1985), and Klette and de Meza 
(1986) have taken up the issue of risk (which was 
pioneered by Dasgupta and Stiglitz [1980]) and have 
written models of patent races that, under some assump- 
tions, yield a market choice of excessively risky R&D 
technologies. That is, R&D competitors pick technologies 
that involve more “variance” '* than is socially optimal. 
This is not entirely surprising. To the extent that a patent 
race resembles a “winner-take-all game,” what matters is 
to be first. not how far behind one finishes in the patent 
race. Because the payoff of discovery becomes zero after 
a given point in time (equal to the rivals’ discovery date), 
a firm’s objective function is convex in its own discovery 
date, and this induces firms to choose risky technologies. 
But a firm, by choosing a risky technology, exerts a 
negative externality on its rivals. It increases its own 
chance of preempting the rivals when the latter would 
have discovered early and thus received a high payoff. 


12. Here entry|does not mean competition, as one monopolist replaces another 


13. When 4 {ends to infinity, the cost of R&D becomes almost linear in 
research intengty: 4 h(x/4) = x h'(0). It can be shown that this leads to very fast 
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innovation. (Recall that firms did not innovate quickly because of decreasing 
returns in the discovery function h.) 


14, A technically more correct definition of risk is that of increasing, risk or 
mean-preserving spreads, See exercise 10.6. 
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Following Dasgupta and Maskin and Klette and de Meza, 
suppose that both firms choose a “safe” technology that 
yields a disdovery time distributed uniformly between 1 
and 2 years.|(Here we abstract from R&D expenditures to 
focus on technology choice.) The discovery will be super- 


time distributed uniformly between 0 and 3 years. Now 
an R&D monopolist would be indifferent between the 
two technologies, each of which would yield an expected 
profit of 2.5 (because the expected discovery date is 1.5). 
However, if] these two technologies are equally costly, 
tivals in a patent race will choose the risky one (see 
exercise 10.6), The negative externality is then easy to 
point out: When both firms choose the safe technology, 
each firm is 4lmost assured of winning when its discovery 


Exercise 10,67" Two firms are engaged in a patent race. 
At date 0, each chooses a technology from a family of 
technologies| indexed by a parameter p. F(t, p) denotes 
the probabiljty that a firm discovers before date f; the 
associated density is f(t p). The rate of interest is 0, and 
the invention, which yields a flow profit v, becomes ob- 
solete at dat¢ T (so the payoff is o(T — f)), For simplicity, 
assume that |F(T, p) = 1 for all p. The cost of the tech- 
nology p is ((p), and p is a parameter of increasing risk 
(see Rothschild and Stiglitz 1970). Recall that an increasing 


d/{t d 4 
aU, iftppa) = z( [ Fl. piat) = 0, 


and that 


ns ({. Fit, pit) 20 
ap\ Jo 


for all x in (0, T], 
(i) Starting from arbitrary technology choices { p,, p>}, 
show that an increase in p2 hurts firm 1 as long as 


f(t. p) —(T— Df. p) 2 0 


(as is the case for the uniform distribution). (Hint: Integrate 
by parts.) 

(ii) Conclude that, starting from a symmetric equilib- 
rium, both firms will be better off if they symmetrically 
reduce p slightly. More generally, show that the coop- 
erative symmetric choice of p is lower than the compe- 
titive one (which, if v coincides with the social value of 
invention—say, because of perfect price discrimination 
—shows that there is socially an excessive risk in patent 
races). (Hint: Write the two cooperative and competitive 
necessary conditions and add them up; then use the 
answer to question i.) 


Now let us consider the issue of correlation. If firms 
choose similar projects, duplication of success (i.e., almost 
simultaneous discoveries) can be expected to occur more 
often than if they chose radically different R&D technol- 
ogies, There has recently been a lot of research on the 
issue of whether competing firms have an incentive to 
choose similar technologies. Bhattacharya and Mookherjee 
(1986) and Dasgupta and Maskin (1986) have analyzed 
the correlation bias when the innovation is patented, 
and Glazer (1986) has performed a similar analysis for 
a nonpatented but proprietary process innovation by 
a product-market duopoly. Dasgupta and Maskin have 
shown that, under some assumptions, the equilibrium 
involves socially too much correlation. The intuition 
for this is that, when a firm moves away from its rival 
in the space of research projects, the first firm increases 
the probability that when it is unsuccessful its rival 
will be successful—which is socially desirable. Hence, 


15, Under discoujting, even an R&D monopolist would enjoy mean-preserving 
spreads in the distovery date, because the objective function, which is propor- 
tional to exp(—ri], is convex in the date of discovery ¢ 


16, This discussidn assumes that firms maximize profits. Managers, however, 
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may have different attitudes toward risk, because their objective function 
generally diverges from profit maximization (see “The Theory of the Firm’) 
Indeed, it is often asserted that managers are too cautious in their inyestment 
choices—see Holmstrom 1983, Holmstrém and Ricart | Costa 1986, and 
Lambert 1986. 
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there may be too much similarity in the choice of project 
characterfstics.'? 


10,2.2.2 | Experience in Patent Races 


is an increasing, convex function. Thus, at each 
f the patent race, the game can be summarized 
by the vactor of experiences for all firms.'* 

Experifnce is an instance of capital, and it is therefore 
not surprjsing that themes encountered in chapter 8 (such 
as natural monopoly and first-mover advantages) appear 
here. Let us start with a useful but not necessarily appeal- 
ing resull, that of “s-preemption.” Consider a patent race 
between |two firms in which both firms make negative 
expected|profits if neither of them drops out of the race, 
but each|firm is viable as an R&D monopoly. This is a 
natural-rronopoly situation, and one would expect one 
of the fitms to drop out.'? It seems plausible that the 
firm that]drops out is the one with less experience (the 
follower)| if the firms have access to the same R&D 
technology and have the same value for the patent. One 
might further conjecture that this firm might as well drop 
out at th¢ beginning of the race. It does not seem reason- 
able to adcumulate experience and then to drop out when 
the probgbility of discovery is higher. Fudenberg et al. 
(1983), ubing a model in which x;() equals either 0 (in 
which cafe firm i exits the race) or 1 (firm i stays in) at 
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each instant, showed that the leader indeed obtains a 
monopoly on R&D, even if it enters the race only a short 
period of time before the follower—hence the term “e- 
preemption.” Harris and Vickers (1985) independently 
obtained the same result under considerably weaker con- 
ditions; they allowed x,(t) to be completely arbitrary (i.e., 
they used a variable-intensity model), 

Although the ¢-preemption result shows that competi- 
tion in R&D may be strongly restricted by first-mover 
advantages and experience effects, competition is often 
observed at the R&D stage. To explain this, we ought to 
relax some of the assumptions of the previous model. 
Fudenberg et al. (1983) show that the key to understand- 
ing competition at the R&D stage is the possibility of 
the follower's leapfrogging the leader—i.e., accumulating, 
more experience and jumping ahead in the race. They 
offer two ways in which leapfrogging can occur. 

The first way is via information lags (i.e, firms ob- 
serve their rivals’ R&D efforts with some delay), To 
understand this, let us come back to the intuition behind 
e-preemption. Consider a foot race between two athletes. 
Assume that it is common knowledge that the two ath- 
letes are equally good, and that they prefer to reserve 
themselves (run at a slow pace) rather than exhaust them- 
selves by running at a fast pace. Suppose further that the 
leader has eyes in the back of his head and can monitor 
whether the follower is catching up. Because the leader 
can keep his lead by speeding up if the rival does so, there 
is no point for the rival in even engaging in the race. The 
leader can thus proceed at a slow pace without fear of 
being leapfrogged. But the picture changes dramatically if 
the two athletes run on tracks separated by a wall. Sup- 
pose that the wall has holes, so that from time to time 
each athlete can check his relative position. Now the 
leader can no longer run at the slow pace; if he did, the 
follower could run fast, leapfrog the leader without his 
noticing it, and force him to drop out of the race at the 
next hole. Thus, lags in information (or in reaction) en- 
gender competition. The same holds for patent races. To 


which the probability of discovery is contingent on the environment and on 
time or is not monotonic (no discovery after a certain amount of research might 
be bad news), 


19. Ina memoryless race with a constant patent value and a constant research 
technology, the set of firms at the date of discovery is the same as the one at 
the beginning of the race; no firm ever drops out 
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formalizd this, Fudenberg et al. consider a discrete-time, 


id cy > 2c,. Thus, absent impatience, an R&D 
prefers to accumulate one unit of experience 
|. In the equilibrium of the patent race between 
s, competition is very vigorous when the 
firms are tied (have the same experience). Each firm 
accumulates two units.?” When the leader is one unit 
of experience ahead, the leader randomizes between 1 
and 2 the follower randomizes between 0 and 2; 
thus, # is competition (with some probability). When 
the leadey’s lead is at least 2, the follower drops out 
of the rake and the leader proceeds at the monopoly 
pace (1), 

The sgcond possibility for leapfrogging offered by 
Fudenberg et al. involves a multistage patent race, which 
allows fof jumps in the experience variable (instead of 
continuous variation, as above). They offer a two-stage, 
fixed-intensity (x,(t)= 0 or 1) patent race. A firm must 
first make an intermediate discovery (the first stage) be- 
fore trying to obtain the patent (the second stage); alter- 
natively, bne could think of the two stages as research 
and devellbpment. In such a race, the first-stage follower 
can leapfrpg the leader by making the intermediate dis- 
covery fist. Again, the possibility of leapfrogging is 
shown to|generate competition. Grossman and Shapiro 
(1987), Harris and Vickers (1987), and Judd (1985) have 
extended |this model considerably, most notably by 
allowing yariable R&D intensities.’ Their conclusions 
are similar|to those of the information-lag model. Compe- 
tition is mpst intense when firms are even. When the lag- 
ging firm Hraws even, both firms intensify their research 
effort. Th¢ leader tends to invest more in R&D than the 
follower. 


10.3 Welfare Analysis of Patent Protection 


The recent research on innovation has focused more on 
the positive aspects of R&D (Is there competition? Who 
invests more in R&D? etc.) than on the normative side of 
the patent system. To be certain, the welfare analysis is 
relatively complex, and more work is necessary before 
clear and applicable conclusions will be within reach. 
Some very incomplete notes concerning the issue of 
whether the market faces enough incentive to innovate 
will follow, the focus of which will be product innova- 
tions, (Very similar remarks could be made about process 
innovations.) 

A good way to begin is to think of product innovation 
as product selection (with a random introduction process), 
We know from chapters 2 and 7 that the market may 
offer too little or too much diversity, The same must 
undoubtedly hold for product innovations. With infi- 
nitely long-lived patents, say, a firm may have too little 
or too much incentive to engage in R&D. The appropri- 
ability effect, according to which the private surplus from 
innovation is lower than the social surplus (in the absence 
of perfect price discrimination), leads to too little innova- 
tion (see section 10.1). In contrast, the business-stealing 
effect, according to which a firm that introduces a new 
product does not internalize the loss of profit suffered 
by its rivals on the product market, suggests too much 
innovation. Actually, another business-stealing effect 
(analyzed in section 10.2) arises in patent races: By in- 
creasing its R&D effort, a firm reduces the probability of 
its rivals’ obtaining the patent, and a typical result is that 
firms engaged in a patent race overinvest in R&D (if we 
assume away the appropriability effect)?* and thus dupli- 
cate too much of the research effort.?* 

The current theories are much too rudimentary to be 
realistic. They generally presume a single patented inno- 
vation, brought about by profit-maximizing firms with 
a single R&D technology. We have already noted some 


20. As long a} they already have sufficient experience. Otherwise, the strategies 
are more complex. See Fudenberg et al. 1983 and Lippman and McCardle 1987. 


21. Gi and Shapiro also allow for the possibility of sharing intermediate 
results ¢ licensing, and for that of forming a joint venture at the first 
stage of r 


22, See Lee aid Wilde 1980 and exercise 10.5 above 
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23. Of course, the existence of several independent research programs is not 
bad per se, because “two chances are better than one.” (Indeed, a single firm may 
set up several research teams Kamien and Schwartz |1982. p. 45] mention that 
the Upjohn Corporation once had six research teams pursuing six different 
approaches to the development of a commercial process for the synthesis of 
cortisone.) Rather, the overinvestment is due to the fact that firms de not 
internalize the loss of patent revenues incurred by their preempted rivals. 
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extensionj—e.g., the firms may choose among several 
R&D. technologies (involving different techniques and 
different amounts of risk), or they may be run by man- 
agers and |therefore may not quite maximize profits (be- 
cause of fhe separation of ownership and control). It 
would als) be desirable to introduce various degrees of 
effectivengss of the patent system, In practice, patents are 
invented ground or inventions are imitated (often with a 
lag). To the best of our knowledge, very little has been 
done concerning the optimal degree of patent protection, 
ue in patent law, But, on the issue of imitation, 
have long recognized that the possibility of 


reciting the public-good argument. The point, however, 
is that th public-good problem is particularly acute (i.e,, 
the incentives to do R&D are low) precisely when the 
positive externalities on other firms (the spillovers) are 
large.** Government subsidization of R&D activity is 
likely to He a good substitute for a failing patent system 
in industries with large spillovers (Spence 1984). 

Now I¢t us depart from the single-innovation para- 
digm. Ev@n in the absence of spillovers, the loser of a 
patent rade does not always lose everything; sometimes 
it comes ip with a patent for another product (or else 
with mor¢ experience for the next patent race). Further- 
more, mohopolies created by patents are temporary, even 
with stri¢t patent protection. New technologies are 
continuoysly invented to replace old ones. Schumpeter 
(1943) referred to this as a “process of creative destruc- 


tion.” It would thus be desirable to formalize successive 
patent races.”° 

Even if we are successful in determining whether firms 
engage in too little or too much R&D, the optimal way to 
encourage or discourage R&D remains to be determined. 
The theory has focused on the optimal patent length (see, 
e.g,, Nordhaus 1969). However, R&D incentives can be 
altered in a variety of ways. At the input level, R&D 
expenditures depend on subsidies. (For instance, the 1981 
US. Economic Recovery Tax Act added tax credits for 
R&D investment; similarly, the Research Development 
Limited Partnership Act offered fiscal advantages to joint 
ventures.) At the output level, the payoff for innovation 
depends on the length of the patent (in the United States 
it is 17 years), on the scope of enforcement of patent 
protection, and on other factors. Little attention has been 
paid to the optimal package of these instruments to en- 
courage an adequate amount of R&D. 


10.4 Alternative Inducements to R&D 


Patents are not necessary for producing appropriability, 
and therefore they are not necessary to induce R&D, 
In general, unpatented innovations still yield gains to their 
inventors, at least for a short period of time. Imitators 
may observe an innovation with a lag, or may not have 
the know-how to copy it immediately. Indeed, patents 
play a minor role in some industries (e.g, computers). 
Still, many economists agree with Schumpeter that pa- 
tents, and the concomitant static inefficiency associated 
with monopoly power, are required to give firms proper 
incentives to innovate, and that patents promote dynamic 
efficiency (although these economists offer little informa- 
tion about the optimal patent length and the optimal 
amount of protection), But there are other methods of 
encouraging innovation, such as the award system and 
the contractual mechanism. 


24, The following informal analysis assumes away the possibility of licensing, 
{see section 1). for an analysis of licensing). In practice, the patent holder may 
Find it profitable to license his patent (rather than let other firms inefficiently 
imitate) and {hus obtain some of the joint surplus for licensing, It should be 
noted, however, that the existence of spillovers raises the licensees’ status-quo 
(no licensing] payoff and thus strengthens their bargaining positions. Hence. 
even if licensng occurs, the threat of imitation reduces the licensor’s payoff. 


25, A mitigafing factor, noted by many authors, is that the loser of a patent 


race can better imitate the winner's discovery if he has kept abreast of the recent 
developments in the field. In particular, R&D enables firms to more easily 
assimilate their rivals’ spillovers. One must then view “absorptive capacity” and 
innovation as joint products (Cohen and Levinthal 1987), This argument is 
sometimes offered to explain the high level of R&D in the semiconductor 
industry, 


26. For a start on this, see Dasgupta and Stiglitz 1981, Futia 1980, Nelson and 
Winter 1982 (chapters 12 and 13), and Reinganum 1985, 
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The award system, in its extreme form, consists in 


, this method has very ancient origins; however, it 
much less frequently than the patent system. An 


bodies 


to transmit technological information (which 
icky when the technological know-how acquired 


atomic energy, and relates some examples of holdups 
created by the Atomic Energy Commission's Patent Com- 
pensation Board.*7 

A more serious rival to the patent system is a cen- 
tralized solution known as the procurement or contractual 
mechanism, Although somewhat similar to the award sys- 
tem, the contractual mechanism differs in that the govern- 
ment controls access to the research market. More pre- 
cisely, the government chooses a certain number of firms?* 
and signs a contract with these firms. The contract usually 
contains more details than are specified when an award is 
offered, For instance, it often specifies that a certain portion 
of the research costs will be borne by the government.?” 
Contracts of this sort may prevent excessive duplication 
of research costs, However, there are incentive problems 
linked to limited yardstick competition, The compromise 
sought between these two factors depends on the re- 
search technology and the ease with which the contract- 
ing firms can be controlled.?° As with the award system, 
the government must know the value of the innovation. 
Obviously, this is facilitated when the main customer for 
the innovation is the agency. This explains why the pro- 
curement system is often used in connection with space 
and defense projects. 


— 
10.5 Strategic Adoption of New Technologies 


Technological progress depends on adoption of new 
technologies as much as on their invention, Whereas the 
previous section focused on incentives and on the process 
of bringing about inventions, this section and the next 
look at the speed of diffusion of innovations. 

Few innovations are adopted instantaneously.*! Two 
reasons for this are that firms may expect an increase in 
demand, and may be hesitant to sink the adoption cost 
before there is enough demand, and that they may expect 
a decrease in the adoption cost or in the uncertainty 
attached to the technology 


30, See Ponssard 1981 for a discussion of how French government agencies 
trade off these two factors. 


31. For instance, the basic oxygen furnace used to produce steel was developed 
in Austria in 1949. In 1960 only 3.7 percent of the US. steel capacity (56.9 
percent in 1970, 85 percent in 1980) used the process (Oster 1982), Similarly, 
the technology to generate electricity using steam turbines was well known at 
the time of its adoption. 
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erage, one would expect diffusion paths to be 
? (indicating that a few firms adopt the inven- 
tion early, that the adoption process accelerates as other 
firms learn about the invention, and that the process de- 
celeratey when most firms have already adopted). Mans- 
field (1968) and his followers have indeed demonstrated 
that the|S surve does well econometrically.>* The precise 
pattern pf adoption in a given industry must, as we will 
see, be studied in more detail, because of the strategic 
aspects pf adoption. 

In thi section we will look at the diffusion process of 
an unpafented innovation in a concentrated industry. We 
will asstme that an innovation made at date 0 can be 
adopted|(in a nonproprietary fashion) by any firm at any 
future dete ¢ for cost C(t). Adoption is a once-and-for-all 
process:|C(f) is a sunk cost. We assume that C’(t) <0 
and C"(}) > 0 (the adoption cost decreases over time, 
but at a decreasing rate) and that C(0) is “large” (no firm 
wants td adopt at date 0). We formalize strategic adop- 
tion by 4 duopoly. Each firm must choose a time (possibly 
infinity) pf adoption. We assume that information lags are 
negligible, in that firms can observe (and respond to) their 
tival’s adtion without a delay, As we will see, this implies 
that the|firms may be tempted to adopt the technology 
early in] order to delay or prevent adoption by their 


which involves two firms and a process in- 


| adopting second is never profitable. The first 


monopoly rent associated with the adoption 
dissipated by the costly early adoption. In 


triggers immediate imitation. One would then expect 
little incentive to adopt. Adoption is delayed and occurs 
at the same time for both firms (actually, it never occurs 
in our extreme example). Thus, a key to the timing of 
adoption in a concentrated industry is the speed with 
which adoption is imitated. 


10.5.1 Imitation-Deterring Innovation: 
Preemption and Diffusion 


To illustrate our first outcome, consider a homogeneous- 
good Bertrand duopoly in which two firms initially have 
constant unit cost 7. Neither firm makes a profit. Adopt- 
ing the innovation reduces the unit cost to c <7. Let 
V = (€ — c)/r. When only one firm has adopted, this firm 
makes Bertrand profit = — ¢ per unit of time, if T does not 
exceed the monopoly price at cost ¢ (nondrastic” in- 
novation). Imitation never occurs in this extreme model, 
because Bertrand competition with identical costs yields 
zero flow profit. Figure 10.1 depicts the payoff of the 
first adopter (the leader) and that of the preempted firm 
(the follower). The equilibrium time of adoption, f°, is 
given by V= C(t‘). If firm 1 plans to adopt after f°, 


Leader's and follower's payoffs 


>t 


Figure 10.1 
Preemption and diffusion. The leader's payoff is L(l) = |V — Cit)|e""; 
the follower's is F(t) = 0. 


32. This is # relerence to the shape of the cumulative distribution of adoptions 
as a functiog of time 


33. Kamien}and Schwartz (1982), reviewing the empitical evidence. also note 
that diffusigh lends to be faster in nonconcentrated industries. However, see 
Hannan ancf McDowell (1984) on the adoption of automatic teller machines in 
various local banking, markets 
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34. The following is drawn from Fudenberg and Tirole 1985, 1987. Their 
analysis owes much to the earlier work of Scherer (1967) and Reinganum 
(1981, b), who assumed that the firms choose their adoption dates irrevocably 
at date 0 (see Quirmbach 1986 for further results obtained by this approach) 
Because firms cannot revise their plans on the basis of observed adoption by 
their rivals, the issue of strategic preemption analyzed in this section does not 
arise in the analyses by Scherer and Reinganum. 
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firm 2 will do better to adopt just slightly earlier. Thus, 


This Yero-profit equilibrium satisfies Posner's two pos- 
tulates (kee chapter 1), First, the monopoly rent is totally 
dissipatad; the adoption cost of the first adopter is equal 
to the sibsequent monopoly profit. Second, this dissipa- 
tion is completely wasteful from a social viewpoint. The 


first-ord¢r condition is 
nV — Cle) = |CUE™)|; 


the interpst on the delayed net profit (V — C(t™)) is equal 
to the cqst savings associated with waiting. Note in par- 
ticular that V > C(™), which implies that #" > f, The 
‘nnovatioht is adopted later, when it is proprietary. 

Of cobirse, the absence of imitation by the follower 
in the ngnproprietary case is an artifact of our extreme 
model. Hor instance, with product differentiation, the 
preemptdd firm will end up adopting in the long run if the 
cost C(})| falls sufficiently. A long diffusion lag between 
the two firms would often be a more realistic description 
of the adpption process. 


10.5.2 |Quick Imitation and Delayed Joint Adoption 


Consider} as a simple example of delayed adoption, the 
following discrete-time model; Each duopolist initially 
makes profit 1 > 1 per period. The current cost of adopt- 
ing a new technology, C, is constant over time (this 
violates gur previous assumptions, but in an irrelevant 
way), wit} 


1<C<(1+n, 


where r is the per-period rate of interest. If only one firm 
(the “leader”) has adopted the technology, its flow rent is 
Tl + 1 and its rival (the “follower”) has flow rent 11 — 1, 
If both firms have adopted, their flow rent is TI again. 
Thus, the innovation serves merely to transfer profits 
from one firm to the other. The innovation can dissipate 
rents, but it cannot increase aggregate profit. At each 
date, each firm chooses whether to adopt (if it has not 
adopted yet) on the basis of history. 

There are several perfect equilibria in this game. We 
will focus on the Pareto-inferior and Pareto-superior ones, 
Note first that reaction is always immediate: If one firm 
adopts at ¢, then the other adopts at t + 1, because the 
flow profit associated with adoption, which is equal to 
1, exceeds the interest on the adoption cost, which is 
Cr/(1 + 1), The Pareto-inferior (preemptive) equilibrium has 
each firm adopt at each date (conditional on its not 
having adopted before), independent of whether its rival 
has yet adopted, In this equilibrium, each firm adopts at 
date 0 and has a payoff of 


(49) n-c<(!) 


Firms are made worse off by the introduction of the 
innovation (this could be called “super” rent dissipation). 
The Pareto-superior (delayed-adoption) equilibrium has each 
firm adopt only if its rival has adopted before. Thus, 
adoption never takes place. Each firm's payoff is 


1 
( + ‘) I 
r 
This is an equilibrium because C > 1. There is no super 
rent dissipation.*® 37 
Although coordination mistakes are possible (each firm 


may suspect its rival and preempt before being pre- 
empted), one would expect both firms to coordinate on 


35. The equllibrium turns out to require mixed strategies of a special kind, 
which are defeloped in Fudenberg and Tirole 1985. 


's never observed theit rival's date of adoption, or if they were to 

‘ves at date 0 to an adoption date, the unique equilibrium 
outcome wotlld be the preemptive outcome described above: Given that the 
ival's adoptibn date cannot be influenced, it is a dominant strategy to adopt as 


403 


early as possible, because the extra flow profit exceeds the interest saved by 
delayed adoption by one period. 


37. This late-joint-adoption equilibrium may depict the delayed introduction of 
compact cars by the US. automobile industry in the 1950s. White (1971) 
asserts that the three industry leaders recognized that if one of them introduced 
a small car, they would all do so; consequently, they waited until the market 
was large enough to accommodate all three of them, and then introduced their 
compacts simultaneously in 1959 
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the Paretb-superior delayed-adoption equilibrium. (The 
argument} for this equilibrium is stronger, the shorter the 
lag between periods. For small lags, each firm gains little 
by preempting and loses a large payoff if its rival decided 
to wait.) Hecause the duopoly never adopts the new tech- 
nology, it automatically adopts more slowly than it 
would if the technology were proprietary. In the example 
developed here, the innovation is adopted immediately 
when it is proprietary. Thus, the relationship between 
property |rights on an invention and the speed of its 
adoption |s highly dependent on the kind of rent brought 
forth by the invention. 

Can wd build interesting models of markets that yield 
a payoff structure similar to the one above? The following 
exercise tqkes up chapter 7’s model of product differentia- 
a line and introduces a product innovation 
that, if adppted by a firm, raises its customers’ valuation 
for its gogd by a constant. The product innovation does 
not increase demand (the number of customers is fixed, 
and they have unit demands), and the industry profit is 
not affect4d by adoption by one of the two firms as long 
as the inngvation is “small.” 


Exercise 10}7* Two firms are located at the two extremes 
of the city) consumers are uniformly distributed along the 
segment, nd transportation costs are linear. The firms 
can introdhice a product innovation that raises the quality 
of their good and thereby raises the consumers’ valuation 
for the ggod by AS. The (fixed) cost of adopting the 
is C per firm. Let t denote the transportation 
se that AS/3 < t and that C < AS(i + r)/3r 


hat when AS is small, this industry profit is 
independeht, to the second order in AS, of the number of 


late-adoption paradigm considered above. 
that the innovation, if it were proprietary, 
lopted instantaneously, (In the whole exercise, 


Exercise 14.8*** Consider a duopoly. The incumbent 


(entrant) njakes flow profit If (respectively, 0) before 
adopting 4 process innovation. If only the entrant has 
adopted, fifms 1 and 2 make flow profits 11? and M$. If 
only the ihcumbent has adopted, it makes flow profit 


MI? > [Tf] If both firms have adopted, each makes a 
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flow profit of TI’, Suppose that I} + TI? <I and 
TI} < M* < M$. Time is continuous, and the rate of in- 
terest is r, The cost of adoption, C(H, is decreasing and 
convex, is “high” at date 0 (no one wants to adopt ini- 
tially), and eventually becomes “low” (both firms end up 
adopting). 

(i) Interpret the assumptions about the flow profits. 

(ii) Show that the entrant is a “faster second,” in 
the sense that it reacts earlier to preemption than the 
incumbent. 

(iii) Draw the leader and follower curves (as in figure 
10.1) for the two firms. 

(iv) Show that the incumbent preempts the entrant. 

{v) Can you think of an alternative specification of 
payoffs that leads to adoption of a nonproprietary tech- 
nology by the entrant first? 


Katz and Shapiro (1984) look at a development game 
in which development leads to a property right on the 
innovation (contrary to the case considered here). Thus, 
the innovation in their model is deterministic and pa- 
tented. As in the adoption game, each firm chooses an 
optimal date of innovation. The Katz-Shapiro model 
introduces two interesting possibilities: licensing and 
imitation. While patenting leads to obvious preemptive 
motives, the possibility of imitation makes the innovation 
more of a public good and introduces reverse incentives. 
Each firm then benefits when the other firm moves first. 
Whether the development game resembles a preemption 
game or a war of attrition depends on how cheap and 
lengthy imitation is. For instance, a reduction in the cost 
of imitation tends to delay development. Licensing gives 
rise to even more complex incentives. On the one hand, 
it may enhance the value of the patent (see the supple- 
mentary section) and thus favor preemption and early 
development; on the other hand, it may also enhance the 
value of being second, as the follower can, at a cost, enjoy 
the patent. 


———— 
10.6 Network Externalities, Standardization, and 
Compatibility 


This section examines adoption in an industry charac- 
terized by complementarity in consumption or produc- 
tion, and studies the influence of the existence of network 
externalities and compatibility decisions on the diffusion 
path. 
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Positive network externalities arise when a good is 
more vpluable to a user the more users adopt the same 
good of compatible ones. The externality can be direct (a 
telephone user benefits from others being connected to 


products can be supplied—and at a lower 
hen the network grows (more programs are 


t the size of the relevant network is either 
ific (as is often the case with automobiles) or 
wide (an extreme example is that of phono- 
‘ause of the standardization of records). 

st issue with network externalities lies on the 


nology to coordinate on. These two considera- 
to two potential inefficiencies: excess inertia 


(users wait to adopt a new technology or to choose 
among sPveral technologies) and excess momentum (con- 
sumers rsh to an inferior technology for fear of getting 


cond issue relates to the supply side, and the 
ologies are chosen and promoted. In the pres- 


standard, evehy when retraining costs are taken into account. However, it would 
be foolish fo} a firm to build this alternative keyboard and for secretaries to: 


are usually standardized. The standards for stereo tele- 
visions in the United States were developed by an 
industry committee and were reinforced by the Federal 
Communications Commission through protection from 
interference by other users. One advantage of standard- 
ization is that it avoids excess inertia.** Another is that 
it reduces the users’ search and coordination costs (see 
Carlton and Klamer 1983), Standardization may, how- 
ever, be difficult. Rapidly changing technologies may 
impose the adoption of an inefficient approach, Also, 
standardization may reduce diversity.*” A non-negligible 
amount of standardization is actually left to the market- 
place. Technologies are then mostly sponsored by firms. 
A dominant firm often sets the standard. (This was the 
case with AT&T before its breakup, IBM has always 
played an important role in imposing standards in the 
computer industry. Home Box Office, the largest cable 
television service, led in defining a signal-scrambling 
standard.) But the adoption of a standard need not be 
associated with a dominant firm. The QWERTY type- 
writer keyboard became standard for typewriters without 
such an association.*” 

When technology choices are left to the marketplace, 
firms sponsoring incompatible technologies have an in- 
centive to develop an “installed base” in order to obtain a 
competitive advantage over their rivals or to prevent the 
rivals from acquiring an advantage. To this purpose, they 
can start with low (penetration) prices and announce pro- 
ducts before they are commercialized. Another important 
element in this strategy is the compatibility decision. 
For strategic purposes, firms can choose to keep their 
products incompatible, thus reducing the size of their 
network. Or they can achieve compatibility, either indi- 
vidually (through the choice of technology or by building 
adapters) or by reaching agreements with their com- 
petitors through committees.*? 

The implications of network externalities for industrial 
organization have been better understood in recent years 


switch ta it individually. See David 1985 for the histary of the typewriter 
industry 
41, In general, the firms have the choice between individual adoption and 
it. They may, in particular, use the threat of “going on theit own’ 
to improve their bargaining position. An interesting contrast arises between 
noncooperative adoption (which, because of conflicting preferences on the 
choice of technology, resembles a preemption game) and cooperative adoption 
(which resembles more a war of attrition, with each firm waiting for its rivals to 
concede). Farrell and Saloner (1987} elaborate on these two themes. 
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because of the work of Farrell, Katz, Saloner, and Shapiro. 
Farrell and Saloner have concentrated on the demand 
tz and Shapiro have focused on the supply side. 
ork will be reviewed in the next two subsections. 


in the |broadcasting and telecommunication-services in- 
dustrieb, respectively.) 

The|new literature on network externalities is deeply 
rooted] in traditional microeconomic and_industrial- 
organidation theories. On the demand side, the exter- 
nalities| give rise to a multiplicity of equilibria, to ineffi- 
ciencieg, and to a need for coordination even when the 
technologies are supplied competitively (these are familiar 
themes in the literature on public finance and, more re- 
cently, |in the macroeconomics literature). On the supply 
side, the choice of technologies relates to the problem of 
produc} diversity. On both sides, decisions by firms and 
users to adopt certain technologies give rise to games of 
timing,|the two polar cases of which are wars of attrition 
and prdemption games. 


10.6.1) The Demand Side: Users’ Coordination 
of Expectations 


Let us begin with a simple illustration of the coordination 


their decision, whatever this decision is. (If these condi- 


tions did not hold, the users would always prefer one 
technology to the other and the coordination problem 
would not arise.) 

Suppose that the two users choose simultaneously 
whether to switch to the new technology. It is easy to see 
that there are two pure-strategy equilibria*?: Both users 
stick with the old technology, or both users adopt the 
new one. Using a terminology due to Farrell and Saloner, 
we can illustrate the possibilities of excess inertia and 
excess momentum. Excess inertia arises when v(2) > x(2) 
but the two users stick to the old technology. Coordinat- 
ing on the new technology would be Pareto superior, but 
each user is afraid of moving alone. Excess momentum 
arises when u(2) > v(2) but the two users switch to the 
new technology for fear of getting stranded with the old 
one. 

Naturally, the inefficiency associated with excess mo- 
mentum or inertia is an artifact of the simultaneity of the 
model. For instance, to avoid excess inertia, one user 
could (in real time) adopt the new technology and entice 
the other user to follow suit, Indeed, Farrell and Saloner 
(1985) argue in a more general framework that excess 
inertia is not really an issue with such congruent prefer- 
ences. Similarly, excess momentum is not really an issue if 
a user can react very quickly to the other user's decision 
to switch. A user sticking to the old technology will not 
get stranded for very long if other users switch.** Thus, 
excess inertia and excess momentum may not be prob- 
lematic except in two circumstances: when the information 
or reaction lags are long** and when the users may have 
conflicting preferences as to the choice of standard (but 
may still enjoy network externalities). 

Farrell and Saloner (1985) consider a model involving 
potentially conflicting technological preferences. Suppose 
that in the previous model, a user’s taste for the tech- 
nologies is indexed by and varies continuously with a 
parameter @ in the interval [0, 1]—say, mp(q) and v,(q). 
We will interpret 6 as a parameter of preference for the 
new technology, in the sense that v»(2) — ue(1) increases 
with 0; that is, the user's willingness to join the other 


42. Otheq relevant papers include Adams and Brock 1982, Arthur 1985, 
Dybvig and Spatt 1983, and Rohlfs 1974. 


43. There] also exists a mixed-strategy one, which gives a lower expected 
utility to Hoth users than the two pure-strategy ones. 


44. In particular, with immediate response to the other user's switch it is a 
weakly dominant strategy not to initiate the switching process, 


45. In this circumstance, coordination is particularly important; still, one would 
expect that firms with congruent preferences (such as those discussed above) 
could communicate or could agree on a standard through a committee 
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user and jadopt the new technology increases with 0. 
Suppose further that v,(1) > 4,(2) and v9(2) < uo(1). 
That is, a user with @ close to I prefers the new technol- 
ogy, independent of the other user's behavior. (Switching 
t strategy for such a user,) Similarly, a user 
to 0 always prefers the old technology. The 


iori (independently) drawn from the uniform 
on (0, 1). The timing of the game has two 


if they have not yet done so. Switching is 
. and payoffs accrue at the end of date 2 (so 


switched af date 1 (i.e., to jump on the bandwagon); and 
(3) to switch at date 1 (possibly initiating the band- 
wagon). is easily seen, the fourth possible strategy 
(switching] at date 2 even if the other user has not 
switched) ik dominated; the user would then be better off 
switching gt date 1, which would increase the probability 
that the other user would also switch. 

Intuitively, each user ought to follow strategy 1 if it 
has a low #f, strategy 2 if it has an intermediate 0, and 
strategy 3/if it has a high @. Indeed, an equilibrium be- 
havior muft take this form (this is just an “incentive- 
compatibility” condition), Figure 10,2 depicts a symmetric 
equilibrium|(a perfect Bayesian equilibrium, in the terminol- 
ogy of the|User’s Manual), The values of the parameters 
0" and O*f such that the user is indifferent between 
strategies ] and 2 and between strategies 2 and 3 are 
given by tHe following equations: 


ge(1) = 942), (10.2) 


) + vgee(1) O° = vge(2)(1 — O°") + thy (2) 0"*. 
(10.3) 


To see thi§, first consider the behavior of a user with 
parameter 4". Neither strategy 1 nor strategy 2 prescribes 
switching at date 1. The choice between the two strat- 
egies hingds on whether the user jumps on the band- 
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oO a a* 1 
t — + + 0 
—= A A = 
Do not Jump on the Switch at 
switch bandwagon at date? 
date 2 
Figure 10.2 


A bandwagon situation in network externalities. 


wagon in period 2 if the other user has switched at date 
1, (Because the user has not switched at date 1, the other 
user will not switch at date 2 if he has not done so at 
date 1.) Equation 10.2 simply expresses the indifference 
between sticking with the old technology and following 
suit, 

The derivation of equation 10.3 is slightly more com- 
plex. It expresses the indifference between strategies 2 
and 3. If a user switches at date 1, the other user switches 
at date 1 or follows suit at date 2 if and only if his 
parameter exceeds (*, which has probability 1 — 0°. The 
first user then gets vy-(2), With probability @*, however, 
switching elicits no response and the user gets vy-(1). If 
the user adopts strategy 2 instead, he switches only if the 
other user has done so in the first period, which has 
probability 1 — @**. He then gets vg-(2); otherwise, he 
gets uy(2). Equation 10.3 and the fact that 0** > @* imply 
that uy+(2) > vy-(1)—i.e., the user with parameter @** 
prefers not to switch alone. Equation 10.3 also implies 
that vy-(2) > wy(2). 

It is easily seen that this equilibrium exhibits excess 
inertia. For instance, if both users have parameters 0, and 
A, just below @*", they would like to coordinate on the 
new technology (as vy+(2) > ug-(2)), but in equilibrium 
they do not. Each user is happy to jump on the band- 
wagon, but neither one is sufficiently eager to set it roll- 
ing himself, as he might be adopting the new technology 
alone. This behavior is reminiscent of the asymmetric- 
information war of attrition discussed in section 9.9. 
Switching first is like supplying a public good, as each 
user would like to make sure that the other user will also 
switch. (Similarly, exiting a natural-monopoly industry is 
a “public good” from the point of view of the concerned 
firms, which do not want to move first.) 

There are several ways to fight excess inertia. First, as 
Farrell and Saloner (1985) show, communication between 
the users alleviates the problem of (symmetric) excess 
inertia. Second, the users may be able to sign contracts (or 
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reach aj ents in committees*®) that enable them ta 
coordinatp.*” Third, as Postrell (who extends the Farrell- 
Saloner model to » users) shows, government subsidies 
ing users also alleviates the problem of excess 


10.6.2 The Supply Side: Sponsorship and 
Strategic Behavior 


The study of users’ expectations is a building block for 
the study of product rivalry. Katz and Shapiro (1985a, 
1986a,b) have studied penetration pricing and com- 
patibility choices by sponsoring firms. The first of these 
papers focuses on the compatibility issue in a static 
model. Consider a duopoly offering two incompatible 
products. Except for their incompatibility, the two prod- 
ucts are identical (perfect substitutes). Consumers have 
unit demands. One unit of product i, sponsored by firm i, 
yields the consumers a surplus equal to v(q,) plus a 
consumer-specific constant, where q, is the size of firm i's 
network (the number of consumers who buy from firm {), 
The function v(-) is increasing. A generalized price jj for 
consumers can thus be defined as p; — v(qf), where p; is 
the price charged by firm i and gf is the consumers’ ex- 
pectation of the size of firm i’s network. Because the two 
products are perfect substitutes, the consumers choose the 
one with the lowest generalized price: p = min(p,, 2). 
The number of consumers buying at p is q = 1 —/p. 
(Consumers, who have unit demands, value good i at 5 
+ v(g§), where 5 is distributed uniformly among them. 
We also assume that each consumer is negligible in the 
sense that he does not affect the size of the network he 
belongs to.) 

Firms compete in a Cournot fashion (see chapter 5), 
They choose outputs (or capacities) q, and q simulta- 
neously. The market then clears at a generalized price of 


B=1—(@ +42). 
The firms thus charge*® 
pi = vq) + 1 — 4 — Ga- 


Under the assumption of a constant and symmetric 
marginal cost of production c, firm i's profit is 


T1'(q,, qj) = gilt + v(4F) — ¢ — 4) — 92). 


48. Adoptior| of the new technology may be inefficient. even if old and new 
users ex ante l prefer the new technology. The point is that old users want to 


consume before the new technology appears, so that after sinking their cost 
they prefer the old technology. 


49. Because the consumers do not observe the output choice, firms cannot 
influence the consumers’ beliefs about the size of their network, (It is also 
assumed that the consumers do not update these beliefs when they observe 
prices that differ from the one they expect in equilibrium.) Katz and Shapiro also 
consider the case in which the quantities are observed by the consumers before 
they purchase 
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Thus, the valuation for the network v(gF) is equivalent 
to a reduction in marginal cost or to an increase in the 
demand| function, 


y find a unique equilibrium under some assump- 


tive to 


'k may be socially too low or too high (as in the 
on product diversity, firms internalize neither 


competitipn in the first period; see the papers cited 
for Katz And Shapiro's exact definitions of “weak” and 
“strong.”| They also uncover an interesting “paradox”: 
Suppose that the first product is cheaper to produce in 
the first period and that the second is cheaper to produce 
in the se¢ond period. (Think of the second product as 
involving|a newer technology, so that there is learning.) 
One might expect the market to be biased in favor of the 
first techrjology, so that a firm would build an installed 
base in the first period and use this installed base to 
corner thd second-period market. It turns out, however, 
that, with|foresight on the part of the consumers, market 


adoption is biased toward the technology that is cheaper 
in the second period. This technology enables a firm to 
penetrate the market in the first period by pricing low; its 
low second-period cost commits it to a relatively low 
second-period price, and thus to a second-period increase 
in the network size. In contrast. the first technology can- 
not promise an extension of its installed base if its owner 
cannot commit in advance to a low second-period price. 


10.7 Concluding Remarks 


Although the main effects governing R&D (appropri- 
ability versus business stealing, spillovers versus patent 
protection, the efficiency effect versus the replacement 
effect) are relatively well understood, economists have 
devoted little attention to the optimal size and mix of 
public policies with regard to R&D (patent length and 
protection, R&D subsidies, and so on). A similar remark 
can be made concerning the adoption and diffusion of 
new technologies. 

In addition to the desirable development of operational 
welfare criteria, two other R&D-related matters are wide 
open for further exploration, First, one can (as was men- 
tioned above) depart from the single-innovation para- 
digm and envision the evolution of an industry, with suc- 
cessive and competing patents on differentiated products 
and/or superior technologies to produce a given good. 
An interesting product of research along this line may be 
a study of the socially optimal degree of protection of 
product and process innovations. Second, formal agree- 
ments between firms (e.g., licensing or joint research 
ventures) play a non-negligible role in the conception and 
diffusion of new technologies. The supplementary section 
contains a discussion of the link between such agreements 
and R&D activity. More work on the topic is warranted, 
if only to better develop the antitrust approach to these 
practices. 


50. In these qapers it is assumed that firms compete in prices. This gives rise 
to.» multiplicky of equilibria of the sort analyzed in subsection 10.6.1. It is 
assumed that Jconsumers (who have the same preferences between the two 


However, capacity constraints restrict the flexibility of the consumers con- 
jectures as to the size of the networks: as Katz and Shapiro show, strong 
assumptions on the (+) function then yield a unique equilibrium. which they 
cannot do under Bertrand competition.) 
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patent. In this supplementary section, we consider briefly 
the possibility of technology transfer to other firms 
under licenging, We will, naturally, encounter some of the 
themes deyeloped in the chapter on vertical control, 


together wjth a few new ones. We will also consider 
another foym of contract that affects the distribution 
of discoverjes and the incentive for R&D: research joint 
ventures. 


10.8.1 The Incentive to License 


We can distinguish two types of licensing. First, an in- 
dependent inventor (or a firm specializing in R&D) may 
be unable tb exploit a patent and therefore may license 
the technolpgy to a “downstream” firm. Second, even if 
the inventor has production capability, he may still license 
a rival, The |ncentive to license is clear in the first case: The 
patent would have no value in the absence of licensing. It 
is less clear|in the second case (or in the case in which an 
independent inventor licenses more than one downstream 
firm). The previous chapters emphasized how product 
competitior] may destroy industry profits (the so-called 
efficiency effect), Consider, for instance, a duopoly. A firm 
a process innovation to its rival may, by 
rival's cost, trigger tough competition and 
lower the industry's profits. On the other hand, licensing 
can occur anly if it raises the two firms’ (and, therefore, 
the industry's) profits. This suggests that there are limits 
to licensing—and indeed, Firestone (1971) notes that 
most of the} patents held by corporations are used exclu- 
sively by those corporations,*! and that most patents 
held by independent inventors are licensed to a single 
firm. 

Still, patie: is anon-negligible phenomenon (further- 
more, a cas¢ might be made that too little licensing takes 
place from pf social point of view). We distinguish three 


motivations to license (for a corporation) or to license to 
more than one firm (for an independent inventor):*? 


+ Product-market incentives Product-market competition 
may create incentives for the managers, who would 
otherwise take advantage of their monopoly position. 
The Rey-Tirole model reviewed in chapter 4 shows that 
the manufacturer of an intermediate product trades off the 
destruction of monopoly power and increased incentives 
when deciding whether to create downstream competi- 
tion. An independent inventor faces the same trade-off 
when deciding whether to license to several manufac- 
turers, and so does a patent-holding manufacturer when 
deciding whether to license to a rival (as long as this 
manufacturer faces managerial incentive problems due 
to the separation of ownership and control). Another 
instance of improved incentives associated with compe- 
tition is formalized in the incomplete-contract models of 
Farrell and Gallini (1986) and Shepard (1986), who show 
that cross-licensing may guarantee ex post the quality of 
the licensor’s product and increase the incentives for the 
product's users to invest in relationship-specific capital 
(see the chapter on the theory of the firm). We will not 
consider these motivations to license further. 


+ Soft product-market competition The argument that in- 
dustry profit is destroyed by competition relies on strong 
Bertrand competition. Recall, however, from chapters 5 
and 6 that product-market competition may be softened 
by three factors: product differentiation, capacity con- 
straints, and intertemporal price collusion. In such cases, 
the industry cost savings associated with the high-cost 
producer's using the low-cost technology may well dom- 
inate the loss in industry profits. This is particularly clear 
in the case of product differentiation. In the extreme 
case, a firm's market position is not affected by a license 
to another firm that serves a nonoverlapping geographical 
market or produces an otherwise unrelated product. 
And it can appropriate some of the cost savings realized 
by the licensee. Licensing is then a purely vertical ar- 
rangement. The case of capacity constraints is considered 
below. 


31. Of course, gnother reason why a firm is reluctant to license a rival is that 
the concomitan{ transfer of knowledge might allow the rival to invent around 
the patent, or e¥en to develop a superior technology. 


410 


52. We will ignore here contracts that, under the pretext of licensing, serve to 
facilitate horizontal collusion; most patent laws are obviously concerned with 
such a possibility. See Scherer 1980 (pp. 173, 452) and Vaughan 1956. 
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+ Strategic licensing The first two motivations dealt with 
“ex post licensing. Licensing can also take place “ex ante” 
her research takes place) to lower one’s rival's 
to invent around the initial innovation. Whereas 


especially on Katz and Shapiro 1985b, 1986c) 
the incentive of a patent holder to license a 
process jnnovation, given that no future innovation will 
occur, 


The figst issue is the type of contract that can be signed 


a final optput. Kamien and Tauman (1983) consider two- 
part tarffs: A+ Rq (the licensee pays a fixed fee A 
for acceqs to the technology and then a variable fee or 
royalty R per unit of final output produced using the 
intermediate technology). Gallini (1984) and Katz and 
Shapiro {in most of their 1985b paper) allow only fixed 
fees; Gallini and Winter (1985) allow only royalties. There 
may in practice exist information or legal reasons why 
contractg are restricted to simple forms such as a fixed fee 
or a royplty. For instance, Katz and Shapiro argue that, 
in some] circumstances, the licensee’s output (or how 
much off this output is produced under the licensing 
agreement) may not be observable by the licensor, so that 
it is imppssible to base transfers on output. In this case 
the fixedtfee contract is a good approximation of reality. 
The evidence shows a wide variety of licensing agree- 
ments in| various industries. Calvert (1964) and Taylor 
and Silberston (1973) observe that about 50 percent of 
licensing] contracts specify royalties only, 10 percent a 
fixed fee|only, and the remaining 40 percent a two-part 


ood industry in which the initial technology 
to all firms) allows production at cost T. As 
before, ah innovation is called drastic if it brings the 
productidn cost to a level ¢ such that the corresponding 
monopoll price is below ¢. We will consider the case of 
drastic innovation and that of nondrastic innovation. 
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10.8.2.1 Drastic Innovation 


We start with the case of a firm “specializing in R&D." 
This firm generates the process innovation, but it does 
not have at its disposal the production structure necessary 
to implement it. Therefore, it transfers the innovation to 
a manufacturer. We assume that the manufacturer is in 
a monopoly situation in the output market (mn = 1). 
(The purpose of beginning with the monopoly case is to 
separate the effects of an optimal transfer contract from 
the effects of competition in the output market.) Recall 
that the producing firm's initial unit cost is f and that 
the (process) innovation lowers this cost to ¢ <7, and 
assume that the firm specializing in R&D may use a two- 
part pricing schedule (i.e., collect A + Rq per unit of time). 
Then the true marginal cost for the producing firm is 
c=(c + R). Let M0) be the profit that the producing 
firm can obtain (prior to paying the fixed fee) in the 
market when its marginal cost is c. 

Exercise 10.9, which is based on the principle of re- 
sidual claimancy (see chapter 4), shows that the optimal 
licensing contract for the licensor is 


{A= M™(c) — 11™(c), R = O}- 


That is, the optimal contract does not include royalties 
and requires a fixed fee equal to the difference in profit 
between using the new and the old contract. The intuition 
is that the absence of royalties prevents distortion of 
the final output, because the marginal cost facing the 
manufacturer is equal to the marginal cost of the verti- 
cally integrated structure. The licensor then extracts the 
licensee's surplus from signing the agreement through 
the fixed fee. 


Exercise 10.9" Show that the optimal licensing contract 
includes R = 0, 


Of course, since the monopoly is in a monopsony 
situation against the firm specializing in R&D, there is 
little chance that such a contract will be signed, (There 
is a bilateral monopoly situation in the “innovations 
market.”) However, the haggling between the two firms 
should be over the fixed premium A (between 0 and 
TI™(c) — T1™(€)), and not over R. The absence of royal- 
ties still prevents distortion at the production level and, 
therefore, generates the maximum total profit to be 
divided. 
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Having solved for the optimal licensing agreement, we 
obtain the value of the patent for the R&D firm: A (per 
unit of time). If the licensor can appropriate the whole 


alistically, the manufacturer is likely to capture 
some of the surplus, and the specialized firm then has 
ive to innovate than the vertically integrated 
structure.|In the language of Williamson, part of the 
specialized firm's investment is expropriated, in that it is 
sidual claimant for the full increase in profits. 
a contract signed before R&D takes place 
restores the specialized firm's correct incentive to invest 


For a 


posed of »f manufacturers (n > 1) each using the old tech- 
nology at|cost ¢. Should the firm transfer the patent to 
any of the producing firms? What profit can it 

its innovation? Clearly, an upper bound on 
profit is the monopoly profit Ic) in the 
product merket. Suppose that the specialized firm auctions 
off its inngvation, and that this auction fixes the royalty 


II™(c), since its pre-auction profit (if any) will 
disappear Jn any case. Hence, the specialized firm may 
obtain Tc) by selling its innovation to the highest 
bidder (ina first- or second-bid auction, for example). 
Therefore, jin the case of a drastic innovation, it is optimal 
for the specialized firm to transfer its patent exclusively. 
To transfey it to many producing firms would only intro- 
duce ex post competition in the product market: therefore 
it would encourage the dissipation of rent and reduce the 
potential pfofit associated with the patent transfer. Simi- 
larly, if the patent is held by one of the manufacturers, the 
invention will not be licensed. Hence, a drastic innovation 
is patel a single firm, 


10.8.2.2 Nondrastic Innovation 


When the innovation is not drastic, and with Cournot 
competition, a nonlicensed manufacturer continues to 
produce a positive amount at marginal cost ¢ (unless 
it incurs a substantial fixed cost), This gives rise to a 
productive inefficiency, and possibly to an incentive to 
extend the license to this inefficient manufacturer. Of 
course, to know if licensing really occurs, we must check 
that the decrease in industry revenue associated with 
more intense product competition (if any) does not offset 
the cost savings. To do so, we must make assumptions 
about the mode of product-market competition. Follow- 
ing the literature, let us assume that the manufacturers 
choose quantities (so the process innovation might be 
thought of as one that reduces the cost of installing capac- 
ities—see chapter 5). Also, for simplicity, let us consider 
a duopoly (n = 2). The patent holder is one of the two 
firms. 

Suppose first that the licensing agreement can include 
a royalty (with or without a fixed fee). Gallini and Winter 
(1985), Kamien and Tauman (1983), and Katz and Shapiro 
(1985b) show that licensing always occurs, Suppose that 
the patent holder offers access to the technology ¢ at 
royalty rate R = — c (no fixed fee), Because the mar- 
ginal cost for the licensee has not changed (c + R =), 
the product-market competition is unaffected, and industry 
profits are increased by (FT — c) times the licensee's output 
(and are appropriated by the licensor). Thus, there exists 
a licensing contract that Pareto dominates the no-licensing 
situation.53-5+ 

Now let us follow Katz and Shapiro and assume that 
only a fixed fee can be specified. With a franchise fee, 
franchising occurs if and only if it raises industry profit, It 
is easily seen that licensing does not always occur. For 
instance, we know from subsection 10,8.2.1 that in the 
drastic-innovation case licensing would reduce industry 
profit; this must also be the case for almost-drastic inno- 
vations. In contrast, Katz and Shapiro show that licensing 
does occur if the firms have very similar costs after the 
innovation. 


53. The above|reasoning does not apply to price competition. Under price 
competition. th licensor does not take the licensee's output as given, and 
therefore the cpyalties affect his marginal incentives, More specifically, an 
increase in cE. Price raises the demand for the licensee's product and 
therefore the rdyalties, The presence of royalties thus softens the licensor’s 
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Price behavior. and a priori should increase industry profits (independent of the 
cost savings), which makes licensing even more desirable 


54. Gallini and Winter, studying the pure royalty case (no fixed fee), show that 
under reasonable conditions the above royalty rate, R =7— ¢, 1s indeed the 
‘optimal contract for the licensor 
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Exercise 19.10" Suppose that in a symmetric Cournot 
duopoly ajsmall and symmetric reduction in cost increases 
firm (and jndustry) profits. (A sufficient condition for this 
is that the Cournot equilibrium be stable and the industry 
marginal yevenue be downward sloping.) Assume further 
that, starting at symmetric cost Z, one of the firms inno- 
vates at ¢|=  — e, where e¢ is small. Show that licensing 
increases |industry profits (and that licensing therefore 
occurs) under the fixed-fee system. (Hint: Use the sym- 
metry of the problem.) 


What bout the social desirability of licensing (given 
that the innovation is exploited noncompetitively)? Katz 
and Shapiro (1985b) note that, very generally, privately 
desirable llicensing, is also socially desirable. For this it 
suffices tHat the industry output grow when one of the 
firms’ cos{ decreases—a reasonable condition. Consumer 


ay not occur.** 


Katz ard Shapiro (1985b, 1986c) also look at the effect 


Gallini (1984) and Gallini and Winter (1985) make the 
interesting, point that licensing may not only reduce pro- 
duction cbsts but may also eliminate inefficient R&D ex- 
penditures. A licensee has less of an incentive to invent 
around the licensor’s patent because its marginal cost 
has decreased, which makes innovation less desirable. A 
simple example illustrates their point. Suppose that firm 1, 


the incumbent, is initially a monopoly and produces at 
cost c. Suppose further that firm 2, the entrant, can, 
by spending K on R&D, acquire a different technology, 
which also gives access to marginal cost c. That is, firm 2 
can invent around firm 1’s patent. Let [* denote each 
firm's duopoly profit. If Tl*> K, firm 2 is willing to 
undertake R&D. However, the two firms have an incen- 
tive to enter a licensing agreement to save the R&D 
cost K. 

Gallini and Winter consider a duopoly model due to 
Reinganum (1983b) and introduce the possibility of li- 
censing ex ante and ex post. The timing is as follows. First 
the firms (starting with asymmetric costs) may enter an ex 
ante licensing agreement. They then decide whether or 
not to undertake R&D (a zero-one decision). Undertaking 
R&D costs K and gives rise to a patentable process in- 
novation at a cost drawn from some distribution. Then, in 
a second stage, the firms can enter an er post licensing 
agreement (given their new costs) and finally compete on 
the product market. Gallini and Winter show that the 
overall effect of licensing is to encourage research when 
the initial costs are relatively symmetric, but to eliminate 
research when costs are far apart. 


10.8.4 Research Joint Ventures 


“Research joint ventures” (RJVs) are arrangements in 
which some firms agree to share the expenditures and 
the benefits associated with a given research project. 
Although conceptually very distinct, RJVs and licensing 
agreements have at least two features in common. First, 
they are contractual practices that considerably affect the 
level of R&D, and the diffusion of innovation, in the 
industry. Second, although initially they pertain to the 
input (innovation) market, they may also involve ancillary 
restraints on the output market. 

Very little work has been done on the subject of RJVs, 
which is surprising in view of their potential importance 


55, Katz and Shapiro also note that licensing is not always socially desirable. 
To see this, suppose that firm 1 faces cost c,, Firm 2's cost is above p"(c;). 
Suppose further that licensing brings firm 2's cost down to ¢,, just under p(c,). 
Thus, firm 4 is intrinsically more inefficient than firm 1. The derivative of 
welfare with{respect to c,, evaluated at 6. = p™(c), is 


(QQ — Hm — eat) 1pm) — eb 
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using the fact that q,=0 at ¢;=p%c) Now, p™c,)>¢,. Thus, under 
the reasonable assumption that Gq, /éc; > 0 in Cournot equilibrium, welfare 
increases with cs. meaning that licensing is not desirable. The idea is that, in this 
case, licensing introduces an inefficiency in the industry's production structure 


R&D and Adoption of New Technologies 


in the ant}trust area (particularly for high-technology in- 


diture (the incentive to do R&D), Depending on the 
industry, |it may increase or decrease R&D activity. Two 
reasons Have been given as to why RJVs may speed up 
innovatign. The first is that when patent protection is not 
completely effective and innovations create spillovers, 
firms that conduct R&D individually do not internalize 
the positive externality on their rivals associated with an 
innovation. They thus tend to underinvest in R&D from 
an industry point of view. An RJV corrects at least the 
externality between its members, and thus increases R&D 
spending of the members, Second, it may be the case that 
the fixed cost of doing R&D is so high that certain firms 
dertake it by themselves. An RJV may give 
s a means to exploit increasing returns to scale 


ls in a nonconcentrated industry seem socially 
, they may well slow down research in concen- 
lustries, 

ay include ancillary restraints that prevent the 


That is, the contract may facilitate horizontal collusion. 
For instance, a royalty per unit of output to be paid to the 
venture raises the firms’ marginal cost, thus lowering out- 
put and raising price. An even less subtle restraint spec- 
ifies the division of the product market for any patent 
that may emerge from the joint research. Such ancillary 
restraints are likely to be socially undesirable. (However, 
one can think of cases in which they are not. For instance, 
a process innovation that yields marginal cost c to two 
firms will create no profit if the firms compete in a Ber- 
trand fashion. Ancillary restraints that smooth product- 
market competition are then necessary in order for an RIV 
of these firms to undertake R&D.) 

To obtain a more complete view of the private and 
social desirability of RJVs, one ought to look at further 
effects (for instance, the exclusionary effect of an RJV 
between a subset of firms in an industry), incorporate the 
possibility that the market may generate too much or too 
little R&D in the absence of RJVs (as we have seen in this 
chapter), and look at the alternative public instruments 
that regulate R&D activity. The topic is certainly worth 
the effort. 


this, go back to the delayed-joint-adoption argument of subsection 
instance, exercise 10.7 gives an example of a product innovation 
that inereajes the innovator's profit by an amount equal to the decrease in its 
rival's profft. That is, consumers benefit from the innovation, not the industry. 


ais 


Instead of assuming that the innovation is nonproprietary. one can change this 
model slightly and assume that the innovation is covered by a patent and that 
the two firms are engaged in a patent race in the absence of RIV. Clearly, if the 
firms formed an RJV, they would not do any R&D. 
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et and Ss 


| 10.1 


Fora All innovation, 


(p™(c))Ip™e) — cl 


ves - Dip™c))lp™c) — cl] 


L Dip"@)ip™@) —Z1} < VS, 


D(c)de. 


Figure 19.3 shows why V* > V* 


Exercise] 10.2 


(i) When the unit cost decreases, the monopoly lowers 
its price, fo consumer surplus increases. Hence, W" > V™, 
Similarly; under competition, the consumer price goes 
down upless the innovation is nondrastic, in which 
case the |consumer surplus is unchanged and W* = V°. 
(The innovation is then entirely devoted to cost savings.) 

(ii) Fot a drastic innovation, the ex post welfare (when 
the cost js c) is the same under monopoly and competi- 
tion. But} ex ante, the welfare is higher under competition 
than under monopoly. Hence, the change in welfare is 
higher under monopoly: W" > W* 
™(c) denote welfare under monopoly and cost c. 
Wc) — W"(é). And similarly for W°. For a 


1 dp™ 
Dip wnt + v) 


dws ae 
7s D(), 


with 


p™(c) >{e. 


For a lingar demand, 
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Figure 10.3 

x + be 

pc) = 2» 

thus, 
ae = —i{a — be) 

and 

= = —(a—be). 


Now, for very minor innovations, c+; hence, W" < W°. 
When the innovation becomes major (drastic), 


This implies that W™ > WS, which confirms our previ- 
ous result that this inequality indeed holds for a drastic 
innovation. More generally, it is easy to show that W™ 
< W* for small innovations and W"> W* for bigger, 
nondrastic ones. 


Exercise 10.3 


The value is equal to 
(a= be)? (abe? 
4b bint 1)? 
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Exercise [10.4 


Let A= —c, D=1— be, and f =1—e'". If a cost 
reduction] A costs KA?/2, the firm maximizes 


2 
vg) = (Ean - ) 
which yidlds 
A(t) = + D. 
The longer the patent (ie, the higher ¢), the higher the 
cost reduktion. Welfare is equal to 
r nr 1-¢ r' be 2 
Wit) = V(t) + aa DA) + Pha 5 
But, fron} the envelope theorem, 2V/é¢ = AD/r. Thus, 
aw _ tip? (: =*) D? | btD? 
a ART Jaw t RE 
1/tD? | b¢t*D? 
( Kt 2K 
=0, 
which yiplds 


3b? + (VK — b)o — rK =0. 


The positive solution lies between 0 and 1, so the optimal 
patent |dngth is finite. Differentiating this equation and 
using tht second-order condition shows that d¢{/db < 0; 
thus, a Ipwer elasticity of demand calls for a longer pa- 
tent. Drqw a diagram, and compare the firm's profit and 
the constimer surplus for various values of the elasticity of 
demand, 


Exercisd 10.557 


(i) The probability that none of the firms has dis- 
covered by time ¢ is 


exp{—Ib(y) + (n — 1)h=)It). 


Let firm] 1 choose intensity y, and let firms 2,3,...,” 
choose iftensity x, Firm 1’s expected profit is 


E Uh(y) V — ye Morro dano gr 
° 


é hiyV—y 
h(y) + (n — h(x) +r 


Letting H(x) = (n — 1)h(x) and differentiating with re- 
spect to y, we obtain the first-order condition 


[H(q) + nith’(yyV — 1) — hy) + h'(yy = 0. (10.4) 


(The objective function is strictly concave.) The first- 
order condition 10.4 yields dR/dH > 0. Thus, when the 
number of rivals increases or when these rivals raise their 
research intensity, firm 1 speeds up its research. 

(ii) The symmetric Nash equilibrium is given by x 
= R(x) or 


[tm — T)h(x) + rth") V — 1] — hz) + A'(x)x = 0. 
(10.5) 


Assume that the left-hand side of equation 10.5 decreases 
with x, (This requires a “stability condition,” according to 
which the reaction curve is not too steep—see p. 432 of 
Lee and Wilde 1980 for more detail.) The left-hand side of 
equation 10.5 is positive at x = 0 (because h'(0) = + 00) 
and negative at x= + 00 (because h’(00) = 0 and h” 
<0). Hence, there exists a unique equilibrium intensity 
x*(n). Differentiating equation 10,5 yields 


[—]dx* + {h(x*)[h'@*)V — 1] } dn = 0, 


where |—] indicates a negative expression (the negativity 
follows from the s.o.c.). But from the concavity of h and 
the fact that h(0) = 0, 


h(x) > xh'(x). 


Thus, equation 10.5 implies that 


h'()V—1>0. 
We thus get 
» 
—> 0. 
dn 


The socially optimal expenditure per firm and unit of 
time is lower than x*. Note the basic externality: When 


57. The following analysis is drawn from Lee and Wilde 1980. 
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a firm lowers expenditures slightly from x*, it incurs a 
second-order loss and it increases its rivals’ profits to the 


first orde: 
es fe 


(i) Firm] 1's expected profit is 
T (py. p2 


= I (T — b) f(t py)I1 — Flt, p2)ldt — C(p,). 


T 
[ o(T — HF (t py E(t pa)dt 


t Tr 
[o«r- ftenn( { ae vids] 
o oO 


T '" 
| w(—f+(T— onl [ Bs) at 
oO 0 


T 7 
| f(T on [ flo) 


cy 0 


ant _ 
Op, 

Thus, 

d om' 

= IM1"(p4 p*)] = ——(p", p*) <0 
rr p ap? 


Nash equilibrium in technology choices, and let {p, >} 
denote th symmetric joint profit-maximizing choices. 
We know fhat 


T1(p*, p*) - Tp, p*) 2 0 
(by definitibn of a Nash equilibrium), and that 


T1' (A, p) T1'(p*, p*) > 0 


(from joint profit maximization). Adding these two in- 
equalities yields 

Nl'(p, p) — TI'(p,p*) > 0, 

which implies from (i) that p < p*. 

Execise 10.7 


Suppose that only firm 1 has adopted. The cutoff location 
xis given by 


(5 + AS) —p, — tk =5 —p, — t(1 — 2). 
The demand functions are then 


t+ As +p» — 
Dy(pu-P2) = aa saa f 
and 

AF +p. — 
Dy(p,.p2) = aera 


In Nash equilibrium, 
Py =e+t+ As/3, 


pr =c+t— 45/3, 


(t+ 45/3)? 

a tenet 

n as 
and 

(t — A5/3)? 

Og eee 

n es 


Thus, I! + 1? = ¢ + (A5)?/3t = + for AS small. The 
condition AS/3 <1 ensures that firm 2 remains active. 
The equilibrium when either no firm or both firms have 
adopted is the same as in chapter 7. 

The follower has an incentive to follow suit: It in- 
creases its profit by 


1l+r/t  (t—As5/3)? 
r G 2F )-e 


~ itras 
=~ ie ag 


for AS small. On the assumption made in the exercise, this 
increase is positive. 


58. This exercite follows Klette and de Meza 1986 and Dasgupta and Maskin 
1986. 
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uch more substantial quality improvements, 
drives its rival out of the market by adopt- 
novation. The payoff for adopting first is 
important, and the equilibrium is one of early 
preemptign. 


Exercise |10.8°° 


(i) 11? + 113 < TIP is the efficiency effect. 1¢ < m* 
< I$ supgests that the new technology is superior to 
the incunjbent's initial technology. 

(ii) The optimal date for following is given by (for 
firm 1) 


rC(TF) 4 |C'(TF)| =m — 11 


and (for firm 2) 
rC(TE) 4 |C(TE)| = 1. 


Thus, 7;'|> T}, owing to the fact that the incumbent 
already njakes a profit before adopting. 

(iv) THe incumbent preempts at date T, such that 
L3(T,) =|F,(T2). 

(v) Suppose that the old technology enables the en- 
trant to produce at cost F, and that the “new” technology 
also allows production at cost f. Clearly, only the entrant 
will ever|adopt this new technology (recall, it is non- 
; otherwise the incumbent might shelve the 
associated patent), 


Exercise [10.9 


For R = 9, the sum of the two profits (that of the special- 
ized firm jnd that of the manufacturer) is equal to T™(c). 
Thus, thd vertically integrated profit is obtained. The 
fixed fee is then used to share this profit. 


Exercise j10.10 


By assumption, 

d " 
—[T'(c, c] + T1*(c,c)] < 0. 
de 


But, because of the symmetry, 


6 1 2 
te {T'(c,c2) + T?(c,c)) 


=14a 2 
=a (cc) + TP(c,0)] < 0. 


Thus, industry profit is increased when one firm's cost i 
decreased for almost equal costs. 


59. This is gly a sketch. For more details, see Fudenberg and Tirole 1986, 
pp. 36-41 
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Noncooperative Game 
Thepry: A User’s Manual 


Noncooperative game theory has become an important 
tool for analyzing strategic interaction between players 
and has found many applications in the field of industrial 
organization. This chapter is intended to introduce the 
aspects of the theory that have proved most useful in 
industrial organization and to familiarize the reader with 
their applications. It will help the reader to develop a kit 
of game-theory tools that he or she will be able to call 
upon when studying industrial organization. The kit will 
be composed of four basic tools, starting with the concept 
of Nash equilibrium for static games of complete informa- 
tion and continuing with its natural extensions to dy- 
namic games of complete information, to static games of 
incomplete information, and to dynamic games of incom- 
plete information. 

The analysis is intentionally informal, and it is not 
meant to be a complete overview. It borrows freely from 
Fudenberg and Tirole 1986c (and also, in the portion 
devoted to dynamic games of incomplete information, 
from Tirole 1983). Serious students of game theory are 
strongly advised to study the more formal literature. 


11.1 Games and Strategies 


There are two ways of formalizing a game. One is to 
describe the extensive form, which specifies the order of 
play, the information and choices available to a player 
whenever it is his turn to play, the payoffs for all players 
(contingent on all players’ choices), and (possibly) a prob- 
ability distribution for moves by “nature.”? The “tree” of 
the game (an ordering of “nodes” depicts this extensive 
form. The tree represented in figure 11.1 describes the 
following two-player game: At “time t = 1,” only player 
one makes a decision. He faces two choices, referred to as 


. For mare formal treatments see Friedman 1986, Luce and Raiffa 1957, 
Moulin 1982,|Kohlberg and Mertens 1986, Kreps and Wilson 1982, and 
Myerson 1984] Moulin’s book contains an impressive selection of exercises 


2. See Kreps and Wilson 1982 or Luce and Raiffa 1957 for more formal 
definitions. 


r plays during only one period, it is unnecessary to 
the actions and strategies per period. For con- 


r 2's nodes, which indicates that he has the same 
ation whether player 1 has moved (or is moving, or 


simultaneous game (game 2), he necessarily has the same 
actidns available, as otherwise he would be able to distin- 
guish the two nodes. In the jargon of this chapter, game 
2 isla static game and game 1 a dynamic game; this is 
becaise in the latter, but not in the former, a player 
(player 2) has the possibility of observing and reacting to 
the pther player's actions. At a formal level, these two 
types of games need not be separated; however, this 
distipction makes the progressive introduction of exten- 
sion$ of the Nash concept more convenient. 

'e assume that the entire structure of the tree is “com- 
mon} knowledge”: All players know it, know that their 
opponents know it, and so on.* Any exogenous uncer- 
taintfes (i.e, moves of nature) must be incorporated into 
the tree.* 

In our discussion of games 1 and 2, we talked about the 
play¢rs moving left or right. These are called pure strategies. 
A phire strategy is the choice by a player of a given 
actign with certainty. In contrast, a player (say, player 1) 
might randomize between left and right, i.e., play left with 


Figure 11.2 
Game 2. 


probability x and right with probability 1 — x, with x in 
[0, 1]. Such a strategy is called a mixed strategy.° A pure 
strategy is a special case of a mixed strategy (with x = 0 
or 1 in the previous example). 

The normal-form representation of an extensive-form 
game is a summarized description of the extensive form. 
The normal form is a collection of the pure strategies 
available to each player at each of his information sets 
in the extensive form. In games 1 and 2, player 1 has 
two pure strategies: aj = L and a} = R. In game 1, player 


3. Wd could represent this game equally well by the extensive form in which 
player]2 moves first but player 1's information set includes two nodes. 


4. Or] the notion of common knowledge, see Aumann 1974, 1976 and 
Milgegm 1961a 


sections 11.4 and 11.5 
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6. More accurately, in an extensive-form game, a behavioral strategy for player 
| specifies for each of his information sets a probability distribution over the 
actions that are feasible in that set. The term mixed strategy is often reserved for 
the normal form (see below). Kuhn (1953) has shown that the two concepts are 
equivalent as long as each player recalls at any point in the tree what he knew 
previously (ie. at earlier nodes). So we will identify both concepts. 
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2 hag four pure strategies: a} = {7.7}. at = {rr}. ad 


ying right. In game 2, player 2 has only two pure 
ies, which correspond to the strategies a} and a3 of 
. The normal form also maps pure strategies into 
for each player. For instance, in game 1, player I's 
for pure strategies {a}, a3} is TT'(a},a3) = 2, be- 
layer 1 plays Left and player 2 reacts by also 


Exercise $1.1" Consider the extensive-form game depicted 
in figurg 11.4 


well-defirled outcome for this game: {a}, a3}. Note that 
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Figure 11,3 
Normal form. 
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ft is not a priori dominated for player 1. How- 
comes so after player 2's dominated strategies 
eliminated. More generally, one can proceed 


(gets —3),7 


Figure 11.5 
The prisoner's dilemma. 


<v, <--» <0», for the object, and these valuations are 
common knowledge. Bidders submit bids b, simultane- 
ously. The highest bidder wins the object and pays the 
second bid; ie. if i wins, he has utility 
TI! = y, — max by. 
det 

The others do not pay anything. 

(i) Show that bidding one’s valuation (bj = u,) is a 
dominant strategy for bidder i. 

(ii) Conclude that bidder » wins and has a surplus of 
Oy — Upaas 

(iii) Would these results be affected if each trader knew 
his own valuation but not those of the other bidders? 


Exercise 11.3°* 
tions 


There are n consumers, with utility func- 


Tl = t, + gia, 0,), 


where f, is consumer i's income, a is a public decision 
(for instance, the quantity of a public good), g;(a, 6,) is 
consumer i's valuation for decision a, and §, is a utility 
parameter. The functions g, are common knowledge. The 
monetary cost of decision a is C(a). 

(i) Show that for a planner who knows all the param- 
eters {9,}7-,, the socially optimal decision, a*(0,,..., 4), 
maximizes 


¥ sila.) — Cla) 


(the Samuelson mule). 


ple of an LO. game, C would stand for “charging a high price’ 
ging a low price 
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(ii) Suppose that 6; is known only to consumer i. The 
planner ties to design a mechanism that induces the 
consumer$ to announce their valuations truthfully and 
implement the socially optimal decision. Consider the fol- 
lowing game: Consumers are asked to announce their 
valuationg simultaneously. 0, is consumer i's announce- 
ment (it tay differ from the truth). The planner then 
implements the decision a*(0,,..., 6,) (ie., the decision 
that is opfimal if all consumers tell the truth), and gives 
transfers; 


400, 


J=K + ¥ g(ar(b,,...,6,),6,) 
i# 


— Cla*(B,,....8,)), 


where K, i a constant, Show that telling the truth (5, = 0.) 
is a domihant strategy for consumer i. Conclude that 
can implement the first-best (full-information) 


ately, in many games the elimination of domi- 


An example is given by the simultaneous game 2. None 
of the fowr strategies is dominated, and this method 
leaves us with an indeterminate outcome. Similarly, in 
Bertrand qr Cournot games of simultaneous choices of 
prices or qhantities (chapter 5), the optimal action for one 
firm depends on that of the other firm, which means that 
one already has a lot of undominated strategies. 

Two other famous two-player simultaneous-choice ex- 
amples are|depicted in figure 11.6. In “matching pennies,” 
players chgose heads or tails. If the choices match, player 
1 receives}1 from player 2; and vice versa when they 
don’t match, In “battle of the sexes,” each player chooses 
between ghing to a movie or to a play. The two players 
always prefer going somewhere together to going any- 
where alone, but player 1 prefers the movie and player 2 


In such fases, the notion of Nash equilibrium yields a 
weaker corjcept of “reasonable outcome.” 
Definition |A set of strategies {a7}7., is a pure-strategy 
Nash equilibrium if and only if, for all a; in A,, 
M(aj,at,) B M(a,,a%,), 


where at, F (a3,...,a7-,a744,..-,a%). In other words, a 
Nash equilibrium is a set of actions such that no player. 


Figure 11.6 


taking his opponents’ actions as given, wishes to change 
his own action. This definition is, of course, straight- 
forwardly extended to allow mixed strategies by letting 
A (the set of probability distributions over A,) be player 
i's strategy set and letting TT‘ denote the expectation over 
the mixed strategies. 

Matching pennies illustrates the possibility of the in- 
existence of a pure-strategy equilibrium. If player 1 plays 
H, player 2 plays T, which induces player 1 to want to 
play T, and so forth. But there exists a mixed-strategy 
equilibrium: Each player plays H and T with equal proba- 
bilities. For this to be an equilibrium, it must be the case 
that both pure strategies yield the same payoff to each 
player. Playing H yields to player 1 


4(1) + (1) =0, 
whereas playing T yields him 
4-1) +40) =0. 


Pure-strategy equilibria need not exist in general games, 
but mixed-strategy equilibria always do (see the supple- 
mentary section). 

The battle of the sexes illustrates the possibility of 
multiple equilibria. {M,M} and {P, P} are the two pure- 
strategy equilibria. There also exists a mixed-strategy 
equilibrium, in which player 1 plays M with probability 3 
(and P with probability 4) and player 2 plays P with 
probability 3 (and M with probability 4). In such cases, it 
is not clear which prediction should be made, No equilib- 
rium is superior to the others for both players. On the 
other hand, some elements of history (not included in the 
description of the game) may suggest a “focal” equilib- 
rium (for instance, that the two players have previously 
alternated, and last week they went to a movie, so that 
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the “nafural” thing to do this week is to go to a play).® 
of selection among Nash equilibria involves 
lersonal judgment (as opposed to the use of a 


differentiability properties. One can then obtain a pure- 
strategy equilibrium, if any, by differentiating each player's 
payoff function with respect to his own action. The first- 
order condition is thus 


T(aj, al ;) = 0, 


where Jj = éI1'/Aa; denotes the partial derivative with 
respect to own action. (The local second-order condition 
is Tj, ¢ 0.) The first-order conditions give a system of 
n equatjons with n unknowns, which, if solutions exist 
and thd second-order condition for each player holds, 
yield tHe pure-strategy Nash equilibrium (or equilibria). 


— bp, + dp, 
d <b. If firm 1 has unit cost c, then 
 — ©)(1 — bp; + dp,). 


t I! is concave in p;. The first-order conditions 
=1,2, 


, Ee be 


~ 2b—d° 

Nowllet us return to games 1 and 2. The simultaneous- 
move game (game 2) has two pure-strategy equilibria: 
{a{,@}}|and {a?, a3}. The sequential-move game (game 1) 
admits these two Nash equilibria plus a third: {a?, a3}, 
which yjields the same actions (and, therefore, the same 
payoffs) as the second. For the simultaneous game, both 
Nash equilibria look reasonable.” This is not the case 
for the fequential-move game, for which we argued that 


there exists a unique reasonable solution (after elimination 
of dominated strategies). The issue, as we will see, is that 
the concept of Nash equilibrium is appropriate only when 
all decisions are made simultaneously (once and for all); 
it is generally too weak when sequential decisions are 
involved. 


Exercise 11.4" Find the pure strategy Nash equilibria for 
the game defined in exercise 11.1. 


Exercise 11.5** Consumers are located uniformly along a 
linear city of length 1. Each consumer wants to buy one 
unit of a good from one of the existing firms. The trans- 
portation cost for the consumer is proportional to the 
distance to the firm he buys from. The law prohibits any 
form of competition through price or service (other than 
location), so consumers go to the nearest firm. A firm's 
utility is equal to the number of its customers. Firms 
located at the same location get the same number of 
customers. 

(i) There are two firms, and they choose their locations 
simultaneously. Show that there exists a unique pure- 
strategy Nash equilibrium, and that both firms locate in 
the middle of the segment. 

(ii) Show that with three firms there exists no pure- 
strategy Nash equilibrium. 


Exercise 11.6** There are n consumers. Each consumer i 
spends an amount of money p; on a public good. The 
choices are simultaneous, Consumer i has utility 


UiPreitd = a(S n) — Pie 


where g(0)=0, g'(0)>1, g'>0, g” <0, and 
lim, .... g'(x) < 1, Compute the Nash equilibrium. Discuss 
the multiplicity. Show that there is too little public 
expenditure. 


11.3. Perfect Equilibrium 


In a Nash equilibrium the players take their opponents’ 
strategies as given and therefore do not consider the pos- 


It might be argued that the two players should coordinate on the second, but 
‘one can change the example slightly so that this does not occur. 
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sibility of influencing them. In games in which a player 
chooses some actions after observing some of his op- 
Ponertts’ actions (what we call dynamic games), this con- 
jecturf is naive and leads to some absurd Nash equilibria, 
as we saw in the previous section. This section introduces 
a refinement of Nash equilibrium for dynamic games that 
mitigates the deficiencies of Nash equilibrium. 

Corsider again the sequential game 1 (figure 11,1) and 
the Nash equilibrium {at,a3}. Player 1 does not play R 
because player 2 threatens to play ¢ in this case. But 
suppoge that player I plays R, Then player 2, faced with 
a fait accompli, benefits from playing +, since he then 
obtains I instead of 0, Therefore, player 2’s threat is not 
crediblp. Player 1, who should anticipate this, plays R, 
which |gives him a payoff of 3—greater than what he 
obtaing by playing L. Thus, the Proposed Nash equilib- 
rium ig based on a noncredible threat, ie, a threat that 
would |not be carried out if the player were put in the 
position to do so. 

The asic idea of perfect equilibrium is to select Nash 
equilibia that do not involve noncredible threats, by 
(roughly) requiring that the players’ behavior be optimal 
even injsituations that are not reached on the equilibrium 
path. Fbr instance, player 2’s decision to play ¢ when 
player ft has played R is not optimal; the reason why 
{aj,a3}] is a Nash equilibrium nevertheless is that this 
suboptimal decision does not cost player 2 anything, be- 
cause pjayer 1 is supposed to play L (in the jargon of 
game theory, player 1’s playing R is a zero-probability 
event). In contrast, a perfect equilibrium requires that 
player 4 play optimally whether player 1 plays Right or 
Left. This amounts to eliminating dominated strategies 
for player 2, which is why perfection gives the same 
answer 4s the iterated elimination of dominated strategies 
for this game. (See below for a class of games in which the 
two methods give the same answer.) 

To obtain the perfect equilibrium, we work “back- 
wards”: By knowing player 2's optimal reaction to each of 
player 1}s potential actions, we can “fold back the tree,” 
as in figure 11.7. Player 1's (respectively, player 2's) valua- 
fions are P and 3 (respectively, 0 and 1) when player 1 has 
played left and right. (The valuations represent the pay- 
offs that} the players obtain when they reach a certain 
location }n the game tree.) The game then boils down 
to a single-decision-maker problem in which player 1 
chooses R. The process of backward induction in the tree 
is called Kuhn's algorithm (Kuhn 1953). 
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L R 
2 3 
oO 1 
Figure 11.7 


For more general games, let us define a (proper) sub- 
game as a subset of the initial game tree that (1) begins 
with an information set that contains only one node, (2) is 
closed under succession (if a node is in the subgame, so 
are all of its successors), and (3) is such that all the in- 
formation sets of the subgame are information sets of the 
initial game. 

In particular, a game itself is one of its subgames. For 
instance, game 1 has three subgames: itself and the two 
subgames starting after player 1 has moved. In contrast, 
game 2 is its only subgame, because of requirement 1. A 
(subgame) perfect equilibrium (Selten 1965) is a set of 
strategies for each player such that in any subgame the 
strategies (truncated to this subgame) form a Nash equilib- 
rium. Thus, perfection requires that strategies be in equilibrium 
whatever the location (understand subgame) in the game tree, 
and not only along the equilibrium path. A perfect equilib- 
tium is necessarily a Nash equilibrium (take the big sub- 
game constituted by the game itself). In game 1 the 
two second-period subgames are single-decision-maker 
problems, Nash equilibrium in these subgames simply 
means that the decision maker (player 2) chooses his best 
action. 

There are two types of commonly used games for 
which perfection is powerful; games of perfect informa- 
tion and games of “almost perfect” information. 


11.3.1 Games of Perfect Information 


Roughly, in these games the player whose turn it is to 
play knows (has perfect information about) all the actions 
that have been chosen before this play. There is no ele- 
ment of simultaneity at all. Formally, all information sets 
have only one node, Game 1 is an example of such a 
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game; the $tackelberg game of chapter 8 and the short- 
run-commifment price game of chapter 6 are others. The 
interesting|thing about these games is that the iterated 
elimination] of weakly dominated strategies on the nor- 
mal form fyields perfect equilibria (at least for finite 
games). Ta see this, start from the terminal period or the 
terminal npdes (there are two such nodes in game 1), 
(First-stage) elimination of dominated strategies for the 
player playing last yields his optimal behavior at each 
terminal node, When the last-period behavior is folded 
back into Yaluations, the next-to-last period becomes the 
last one, again the (second-stage) elimination of domi- 
nated stratlegies yields optimal behavior, and so forth. 
Thus, the iterated elimination of dominated strategies 
satisfies batkward induction in the tree (Kuhn 1953). (The 
two concepts are almost equivalent. For an example in 
which they differ, replace the payoff 3 in game 1 by 
payoff 2. [R, r} is a perfect equilibrium, but is ruled out by 
the eliminalion of weakly dominated strategies. The reader 
will check that the iterated elimination of strongly domi- 
nated strat¢gies on the “agent normal form” yields exactly 
the set of perfect equilibria. See subsection 11.6.1 for the 
definition pf agent normal form; see Moulin 1982 for 
results on the elimination of weakly dominated strategies.) 


Example 1] Let us consider an algebraic example with 
the two-p¢riod structure of game 1. For this, consider 
the same frice game as in section 11.2, except that firm 
2 observed firm 1's price before choosing its own. The 
backward-induction logic requires that firm 1 foresee that 
firm 2 will react optimally to any choice of p,. That is, 
firm 1 shold solve the second-period optimization prob- 
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and 
P2 = Ra(pi). 
(The prices are higher in the sequential game than in the 


simultaneous game. See the explanation in terms of stra- 
tegic complements in chapter 8.) 


Example 2. Consider Rubinstein’s (1982) bargaining game 
in which two players, who must share a pie of size 1, 
make sequential, alternating offers. At date 1, player 1 
makes offer x, in 0, 1); player 2 accepts or refuses x,, If 
he accepts, he obtains 1 — x,, leaving x, for player 1. IF 
he refuses, he makes offer x, in [0, 1] at date 2. If this offer 
is accepted, he gets 1 — x; in period 2 and player 1 gets 
2} if player 1 refuses this offer, he gets to make an offer 
x3 in the third period, and so on. The players alternate 
making offers until one of them accepts his opponent's 
offer. The payoffs are 5'x, for player 1 and 6'(1 — x,) for 
player 2 if they agree at date t on a share x, for player 1. 
The discount factor 5 belongs to (0, 1). Impatience will be 
the driving force that leads to agreement in this model. 
This is a game of perfect information. Each player, when 
making or responding to an offer, knows all actions taken 
before his move. Suppose that there are T periods. To 
solve for the perfect equilibrium, look at the last period 
first. Clearly, the player who makes the offer at T demands 
the whole pie because the other player will not be able to 
make a further offer. So, x; = 0 if T is even and x, = 1 
if T is odd. At T — 1, the player who makes the offer 
gives the other player a share of the pie such that the 
latter is indifferent between getting this share at T— 1 
and getting the whole pie at T. And so forth. 


Exercise |:.7** Solve the previous bargaining game for 
T=2,3,.. . Show that x, tends to 1/(1 + 5) when T 
goes to infinity. 


Rather than solve the finite-horizon case (see exercise 
11.7), we focus on the infinite-horizon case, to show how 
direct backward induction can be replaced by the use of 
valuation functions. Let us assume that the horizon is 
infinite. Conditional on the game's not being terminated 
(no offer has been accepted), the game “looks the same” 
at all odd periods; and similarly for even periods. Let 
us thus look for “stationary strategies”: When player 1 
makes an offer, he always offers x,. When player 2 makes 
an offer, he always offers x. These offers and any that 
are more favorable for the responding player are always 
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player hre rejected. 

Let ¥, denote player i’s valuation when it is his turn to 
make ah offer; that is, V, is the payoff that i expects when 
making] an (optimal) offer. And let W, denote player i's 
valuation when it is the other player's turn to make an 
offer. fe that because strategies are stationary, these 


nove any that are less favorable for the responding 


valuatipns do not depend on time. Note also that from 
the definition of x, and x, 


V, =a), W,=1—4,; VV, =1—2,,W, =x 
and tha} 
V, + Wr = Va + W, =1. 


Now, when player 1 makes the offer, he offers player 2 a 
share that makes the latter just indifferent between ac- 
cepting|now and waiting one more period (offering more 
would he pointless). Thus, 


1—x, - 61 — x) 


or 

Ww, = aV>. 

And sinjilarly when player 2 makes the offer: 
Xy = da 

or 

w, = 4V,. 


Because| the game is symmetric, V, = V; = V and W, 
= W, + W. Thus, W= dV and V+ W=1 lead to 
V= 1/1 + 6) and W = 6/(1 + 4). So the player who 
makes the offer gets 1/(1 + 4), and the other gets the rest 
of the pie: 6/(1 + 5). The equilibrium strategies are these: 
The off¢ring player offers a share 6/(1 + 6) to the re- 
sponding player; the latter accepts any share at least equal 
to 6/(1 + 6) and rejects any lower share. 

Note |that this equilibrium is the limit of the finite- 
horizon pquilibrium when T goes to infinity (see exercise 
11.7). Rubinstein showed that this is also the unique equi- 
librium of the infinite-horizon game.'° 


Exercise 11,8°"* Show that the infinite-horizon equilib- 
rium is unique. Hint: Introduce the following valuations. 
V; (respectively, V,) is the highest (respectively, the lowest) 
possible payoff for player i in the set of perfect equilibria 
when it is his tum to make the offer. Define W, and 
W, similarly. What are the relationships between these 
numbers? Conclude that V, = V, = 1/(1 +6) and W, 
= W, = 6/(1 + 4). 


Exercise 11.9" Consider the following simple-minded in- 
spection game: A firm may pick an anticompetitive action 
(ie, may prey) or not, Preying yields it an (extra) mone- 
tary payoff of g > 0. Antitrust authorities can inspect 
or not. The cost of inspection is ¢ > 0, If the firm has 
preyed and inspection occurs, the firm pays a penalty 
p > g; the antitrust authorities’ payoff is then s — c > 0. 
If the firm has not preyed and inspection occurs, or if no 
inspection occurs, no penalty is paid and the authorities’ 
payoff is —c or 0, respectively. 

(i) Suppose that the antitrust authority observes whether 
predation occurred before choosing whether to inspect. 
Draw the game tree. What is the perfect equilibrium? 

(ii) Assume that whether predation has occurred is not 
observed before the inspection decision. Draw the game 
tree, Argue that there exists no pure-strategy equilibrium, 
Compute the mixed-strategy one. How does a change in 
the penalty affect this equilibrium? 


11.3.2. Games of “Almost Perfect” Information 


Suppose that the game can be decomposed into a number 
of periods t = 1,2,..., T (where T is finite or infinite), and 
that at each date ¢ the players simultaneously choose 
actions knowing all the actions chosen by everybody at 
dates 1 through # — 1. Because such a game introduces 
simultaneity only within a period, we call such extensive 
forms games of “almost perfect” information. The simplest, 
example of such a game is the “repeated game” in which 
a simple one-period simultaneous-move game (such as 
those depicted in figures 11.2, 11.5, and 11.6) is repeated 
T times and, at date t, players know all the moves before 
+ Ina repeated game there is no physical link between the 
periods.'' In contrast, the sequential quantity-price game 


10. In Lainkrrob eas games may have other equilibria than the limits 
of the equilipria of the finite-horizon game when the horizon tends to infinity. 
See chapter|6 and subsection 11.5.2 for examples. See Fudenberg and Levine 
1983 for an }nvestigation of the link between the two. 


11, The one-period simultaneous-move game is called the constituent game. 
Each period starts a proper subgame for each history of past moves, 
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of chapter] 5 and the entry, accommodation, and exit 
games of khapters 7 through 9 involve physical links 
through alll kinds of investments. For instance, when firms 
learn by doing, price competition at t does not look the 
same as that at f—1 because the cost structure have 
changed. 4 brief introduction to repeated games should 
suffice herg, (For more on repeated games, see chapter 6.) 


Example: Rppeated games Let us return to the “prisoner's 
dilemma” \game described in section 11.2, and let us 
assume that the players play this same game repeatedly 
(and learn past moves along the way). Assume that each 
player's payoff is equal to the present discounted value of 
his per-petiod payoffs over the time horizon (with some 
discount f4ctor 6 in (0, 1). Suppose first that there are a 
finite number of periods T. To solve for a perfect equilib- 
rium, work backwards from the end of the horizon. At 
date T, thd strategies must specify a Nash equilibrium for 


iod-T outcome is independent of what hap- 
I, Hence, the strategies at T— 1 must also 
period Nash equilibrium. Hence, both players 


riod, whatever the history, is still a perfect 
; Given that the future outcomes are not 
what is played now (they will fink anyway), 


by finking. But there exist other equilibria. Consider, for 
instance, the following symmetric strategies: At any date 
+ a player cooperates if and only if both players have 
always cooperated between periods 1 and # — 1. At date 
1, both cooperate. This forms a perfect equilibrium if 
6 > 4: Ina subgame starting at date t in which one player 
has finked in the past, both fink forever. We already know 
that such strategies are perfect (and, therefore, Nash). In a 
subgame in which no one has yet finked, the strategies 
also form a Nash equilibrium; they yield 


21 +5 + 6? +--+) = 21 — 4) 


from f on. If a player deviated and finked at date f, he 
would get 
26 


j—3'———— 


_ 2 
3-20 4+.8 + wae 


because from time ¢ + 1 on both strategies will specify 
“fink forever.” Thus, for a player to cooperate we need 

2 Se 3-56 
1-6 é 


(the long-run gain from sustaining cooperation exceeds 
the short-run gain from deviating), which amounts to 
b>%. 

There are many other equilibria for the infinite horizon. 
The so-called folk theorem gives an exact characterization 
of the set of perfect equilibria of general repeated games 
with infinite horizon when the discount factor is very 
near 1.'3 


11.4 Bayesian Equilibrium 


11.4.1 Games with Imperfect or Incomplete 
Information 


Game theorists make a distinction between imperfect in- 
formation and incomplete information. Roughly speaking, 


12 The combinations of one-period equilibria are only some of many equilibria 
if the one- game has multiple equilibria See Benoit and Krishna 1985 for 
a general reslt; see Frayssé and Moreaux 1985 for a simple application to the 
repeated Counot game. 


payoff that ofher players could farce i to get in the one-period game: 


min max I1'( 


a) 
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—can be sustained on average as a perfect-equilibrium payolf of the infinitely 
repeated game for §= 1 (Aumann and Shapley 1976; Rubinstein 1979) or, 
under a weak condition. for 5 close to 1 (Fudenberg and Maskin 1986). For 
instance, in the prisoner's dilemma game the individually rational payoffs turn 
‘out to coincide with the equilibrium payoffs (— 2, ~ 2) of the one-period game. 
The set of equilibrium payoffs in the infinitely repeated game is thus the set of 
convex combinations of the four payoff couples of figure 11.4. For discount 
factors of less than 1, see Abreu 1984. The supplementary section of chapter 6 
contains a more complete discussion of the folk theorem. 
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a player has imperfect information when he does not 
know what the other players have done beforehand,'* 
On the|other hand, a player has incomplete information 
does not know his rivals’ precise characteristics 
ces, strategy spaces). For example, consider re- 


's), but it is one with imperfect information if, at a 
int in time, they do not know how much their 
rivals have spent so far on R&D. Actually, this distinction 
is somevhat semantic—a game with incomplete informa- 
tion may be transformed into one with imperfect informa- 
tion by| first introducing a new player, “nature,” who 
chooseg the characteristic or the type of each player, and 
then asquming that the players, other than the particular 
one in question, are not informed of the choice of the 
player’s| type (see Harsanyi 1967—68).'* The equilibrium 
concept} described in section 11.5 encompasses games of 
incomplete and imperfect information. 

The tharacteristic (or type) t, of a player i embodies 
everything that is relevant to this player's decision 
making} In practice, it is often posited that t, is some 
parameter of player i's objective function (cost or demand 
parameter, for instance) known to player i, One further 
assumes that it is common knowledge that types are a 
pmori drawn from some known distribution 
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11.4.2 Static Games of Incomplete Information 


In this subsection we will be concerned only with games 
in which players move simultaneously, so no player has 
the opportunity to react to another's move. Hence, we 
can abstract for the moment from the issue of perfection 
discussed in section 11.3. And, importantly, we need not 
worry about the inferences made by players about their 
opponents’ types, because all the actions are taken before 
any actions are observed. Hence, inferences are con- 
sequenceless. Dynamic games of incomplete (or imper- 
fect) information will be considered in section 11.5. (Note 
that in a dynamic game the question of inferences can 
become important.) 

We assume that player i picks some action a; in A, (as 
before, A; can be extended to include mixed strategies) 
and has an ex post payoff of 


Player i's decision naturally depends on his information 
#: Let a\(1,) denote this action, Harsanyi’s Bayesian equi- 
librium is the natural generalization of Nash equilibrium 
to games of incomplete information. lt assumes that each 
player i correctly anticipates what actions all players j # i 
choose. Because these actions depend on types, he cor- 
rectly computes the functions {a)(t))}; xi. 


Definition A Bayesian equilibrium is a set of type-contin- 
gent strategies {a7(t,)}/-, such that each player maximizes 
his expected utility contingent on his type and taking 
the other players’ type-contingent strategies as given: 
a; = aj(t;) maximizes 


DY pilteilti Vay (ty), erie eee Oly) hyp eesrtinesen ty) 


In other words, player / anticipates that player j will play 
aj(t,) if the latter has type t,; however, not knowing ;, the 
former must compute his payoff in expectations. One 
could as well think of a player i as having several incarna- 


original players at the start of the game. Each player observes only his own 
characteristics and thus has imperfect information about nature's choice of his 
opponents’ characteristics. (To endow nature with an objective function, one 
can assume that itis indifferent among all choices of characteristics and that it 
randomizes according to the initial probability distribution.) 


16. See Harsanyi 1967-68, Mertens and Zamir 1983, and Brandenburger and 
Deckel 1985, 
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tions, such as a player i with type #; and another, different 
player i with type t/, Which incarnation plays against the 
other players is drawn according to the prior distribution. 
Thus, a Bayesian equilibrium can be seen as a Nash equi- 
librium wjth | 7;| players, where |Tj| is the number of 
potential pes for player i 


Example 1| Let us start with a straightforward example. 
In the twp-player, simultaneous-move game depicted in 
figure 11)8, player 1 has only one type (player 2 has 
complete jinformation about player 1), Player 2 has two 
possible types: t; and #5. While player 2 knows his type, 
player 1 puts equal probabilities on the two types. Each 
player hag two possible actions (up or down for player 1, 
left or right for player 2). Each cell represents the payoffs 
of player 1 and 2. For instance, the upper leftmost cell 
says that if player 1 plays LU and player 2 has type t, and 
plays L, they get 3 and I, respectively. Note that player 
1's payoff/depends only on the chosen actions, and not on 
who playér 2 is. Solving for a Bayesian equilibrium here is 
straightfotward, Each type of player 2 has a dominant 
strategy. Regardless of what player 1 does, t, chooses L 
and ts chooses R (that is, a3(t;)=L and a}(ty) = R). 
Now everything is as if player 1 faced an opponent who 
played L 4nd R with equal probability, because t, and t4 
are equally likely. Thus, by playing LI (respectively, D), 
Bets }(3 + 2) (respectively, 4(0 + 4), Hence, 


Type t, 


Type t 


Example 2 Consider a duopoly with Cournot (quan- 
tity) competition. Let firm i’s profit be quadratic: M' 
= gilt; — 9, — 9;), where t; is the difference between the 
intercept of the linear demand curve and firm i’s constant 
unit cost (i = 1,2) and q; is the quantity chosen by firm i 
(a; = q;). It is common knowledge that, for firm 1, t; = 1 
(firm 2 has complete information about firm 1). Firm 2, 
however, has private information about its unit cost. Firm 
1 knows only that +) = 3 or 3 with equal probabilities. 
Thus, firm 2 has two potential types, which we will call 
the “low-cost type” (t; = 3) and the “high-cost type” 
(t, = 3). The two firms choose their outputs simultane- 
ously, Let us look for a pure-strategy equilibrium. Firm 1 
plays qy, firm 2 plays q} (if t; = 3) or qi (if t; = 9). Let 
us start with firm 2: 


42(t>) € arg maxlqa(t; — 92 — 41)I 
@ 


= galts) = (th — 41)/2, 


where “arg max” denotes the set of values that maximize 
the objective function. Let us now consider firm 1, which 
does not know which type it faces. 


qx € arg max|}q,(1 —q, — 43) + 4q,(1 — 4, — 43) 
a 


=> qi = (1 — Eqz)/2, 


where E(-) denotes an expectation over firm 2's types. 
However, 


Eqz = 4q3! + qh = (Et, — q,)/2 = (1 — q,)/2. 


One thus obtains {q, = 4,45 = 44.4! = #4} as a (prov- 
ably unique) Bayesian equilibrium. This simple example 
illustrates how one can compute the Bayesian equilibrium 
as a Nash equilibrium of a three-player game.'7 


Example 3. Assume that two players bid simultaneously 
for an indivisible object (a, = b;). The highest bidder gets 
the object and pays his bid. (Thus, if b; > b,, i gets the 
object and pays b; to the seller. If b; = bj, the object is 
sold at the common bid randomly—the specification of 
the auction in this case does not matter, because identical 
bids will have probability 0.) A player's type t, is his 
valuation for the object, so player i gets t; — b; if he wins 


17. See Saloner 1987 for more on this game. See section 11.6 for the sequential 
(ie. Stackelberg) version of the game. 
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and 0 if he loses. Types t; are drawn independently from 
distribution on [0, 1]. Let us look for a Bayesian 
in which a player's bid is a strictly increasing 


use the game is symmetric, let us derive sym- 
ilibrium strategies: b = b*(#), Player i’s payoff 


But, fron] player j’s strategy, 

Prob{b,; € b} = Prob{b*(t;) <b} 

= Prob{t; < b* '(b) = @(b)} 
= H(b), 


where ®{b) denotes the inverse function of b* (in words, 
(b) is player j’s valuation when he bids 6), and use is 
made of the uniform distribution of types (if @ is distrib- 
uted unifprmly on (0, 1] and k € [0,1], then Prob{@ < k} 
= k), Thhs, player 7 maximizes 


TT! = (t + by(b), 


which yi¢lds the first-order condition 


—(b) +4 (F — b)@'(b) = 0. 


Now, in] order for b*(-) to also be player i’s optimal 
strategy,|it must be the case that the latter is of type 
+ = (b)|when bidding b. Hence, 


D(b) = [p(b) — b}@’(b). 


This diff¢rential equation has the obvious solution (b) 
= 2b. THus, a Bayesian equilibrium of the game has each 
iding half of his valuation: b*(#) = ¢/2."® Similar 
are used to solve “wars of attrition’; see 


cumulative distribution: F(t) = f* on 0, 1). Show that, in 
equilibrium, the bid function is linear in the bidder's valua- 
tion. What happens when n tends to infinity? 


Exercise 11.11** Suppose that each bidder's estimate of 
the value of an object affects the other bidders’ own 
valuations for the object, The extreme case of this situa- 
tion is the common-value auction in which the ex post 
value of the object (e.g., an oil lease) is the same for all 
bidders and each bidder has a private and imperfect esti- 
mate of the common value. Winning then conveys bad 
news to the winner, who learns that the other bidders 
attached a low value to the good, This winner's curse 
must be anticipated by a rational bidder. Consider a first- 
bid auction for a good with common value # = x, + x3. 
The parameters x, and x, are independently drawn from 
the uniform distribution on [0, 1]. There are two bidders, 
i = 1, 2. Bidder i knows x; but not x,. 

(i) Argue informally that bidder i's payoff, if he wins, is 
necessarily smaller than x, + 4 — b, (where b, is the win- 
ning bid). 

(ii) Look for a symmetric equilibrium. Let x = ©(b) 
denote the inverse bid function, Argue that, for a given 
x, b, (= * (x,)) maximizes 


{x, + Elxylx; < O(b,)] — by} O(b). 


(iii) Compute the equilibrium bid function. (Hint: It is 
linear.) 


Example 4 Simultaneous-move games of complete in- 
formation often admit mixed-strategy equilibria, Some 
researchers are unhappy with this notion because, they 
argue, “real-world decision makers do not flip coins.” 
However, as Harsanyi (1973) has shown, mixed-strategy 
equilibria of complete-information games can often be 
vindicated as the limits of pure-strategy equilibria of 
slightly perturbed games of incomplete information. In- 
deed, we have already noticed that in a Bayesian game, 
once the players’ type-contingent strategies have been 
computed, each player behaves as if he were facing mixed 
strategies by his opponents, (Nature creates uncertainty 
through the choice of types rather than the choice of the 
side of the coin.) To illustrate the mechanics of this con- 


auctions, seq Myerson 1979 and Milgrom and Weber 1982. On characteriza- 
tion and uniqueness in the first-bid auction, see Maskin and Riley 1983, 
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hs two possible actions: investment and no 
investment. In the complete-information version of the 


This cleafly is an equilibrium: Each firm makes 0 if it does 
not invegt and makes (1) + 4(—1) = 0 if it does invest. 
Now coysider the same game with the following incom- 
plete infdrmation: Each firm has the same payoff structure, 
except that when it wins it gets (1 +f), where ¢ is uni- 
formly djstributed on |—¢, +2]. Each firm knows its type 
t, but nof that of the other firm. Now, it is easily seen that 
the symmetric pure strategies, “a(t < 0) = do not invest, 
a(t > 0) = invest,” form a Bayesian equilibrium. From the 
point of| view of each firm, the other firm invests with 
probability 4. Thus, the firm should invest if and only if 
}1+HH 4(—1) 20, ie, t> 0. Because for a given 
type a player has a unique best siete there is no problem 


zero, the} pure-strategy Bayesian equilibrium converges to 
the mix¢d-strategy Nash equilibrium of the complete- 
informatjon game. 

Becauke games of complete information are an idealiza- 
tion (in practice, everyone has at least a slight amount of 


incomplete information about the other players’ objec- 
tives), Harsanyi’s argument shows that it is hard to make 
a strong case against mixed-strategy equilibria on the 
ground that they require a randomizing device.'® 


11.5 Perfect Bayesian Equilibrium 


We now study dynamic games of incomplete (or imper- 
fect) information. The tricky feature in these games is that 
a player who reacts to another player's move can extract 
information from that move. It is natural to suppose that 
the inference process takes the form of Bayesian updating 
from the latter player's supposed equilibrium strategy and 
his observed action. The equilibrium notion is a com- 
bination of the (subgame) perfect-equilibrium concept for 
dynamic games and the Bayesian-equilibrium concept for 
games of incomplete information. This section introduces 
the simplest such notion: perfect Bayesian equilibrium 
(PBE)?° and gives four simple applications of the concept. 
To motivate the notion of equilibrium, we begin with 
the simple game of imperfect information depicted in 
figure 11.10. This game has three players and takes place 
over three “periods.” In period 1, player 1 can choose 
from among three actions: “Left” (L,), “Middle” (M,), 
and “Right” (R,). If player 1 chooses one of the latter 
two, player 2 gets to choose between “Left” (L,) and 
“Right” (R), although he is not informed of player I's 
exact choice (he knows only that player 1 did not choose 
L,). The imperfect information of player 2 is represented 
by an information set {M,,R, }, characterized by an oval 
around the two corresponding nodes (n; and n;). Given 
his state of information, player 2 is faced with the same 
choices at nodes n and my. Finally, for move {M,,R>} or 
{R,,L,} player 3 must choose between “Left” (Ls) and 
“Right” (R3) in the third period without knowing which 
of the nodes (nt or ms) the game has attained. The values 
of the objective functions are written at the bottom of 
the tree. For example, for moves (M,,L3,R3) player 1 
receives 3, player 2 receives 2, and player 3 receives 0. 
The following parable should prove helpful in under- 
standing the equilibrium concept. Suppose that an econo- 


19. See Milgrom and Weber 1986 for sufficient conditions on the objective 
functions afd information structure so that the limit of Bayesian equilibrium 
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strategies when the uncertainty becomes "negligible" forms a Nash equilibrium 
of the complete-information limit game. 


20. The refinement of this notion is discussed in section 11.6. 
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Figure 11.,0 
Game 5 


mist has fo solve game 5 and, not knowing what to do, 
he turns to two people who have very specific talents. 
The fifst consultant is a game theorist. He is well 
versed in|the method of solving perfect equilibria of dy- 
namic games, as described in section 11.3, On the other 
hand, he|is sufficiently familiar with decision theory to 
understand the concept of expected utility (payoff). How- 
ever, he }s not aware of Bayes’ law for calculating pos- 


bility (say, 43) to the game being at node n, (so the 
conditional probability of 15 is 1 — 13), Similarly, player 


is identichl to a perfect-information game, which I can 


solve.” In fact, given y5, player 3 chooses the action 
that maximizes his expected payoff. Player 2 knows the 
optimal strategy of player 3, and he also maximizes his 
expected utility given the subjective probability j1,. Finally, 
given the other players’ optimal strategies, player 1 
chooses according to his own self-interest. Therefore, the 
game theorist reports to the economist the correspondence 
between the subjective probabilities. and the optimal 
strategies obtained by backward induction: 


a*(u) = (ah (1), a2 (1), a4(W)}, 


where aj can be a mixed strategy, 

The second consultant is a Bayesian statistician. Calcu- 
lating posterior probability distributions is second nature 
to him; however, he is familiar with neither game theory 
nor decision theory. Possessing a scientific mind, he makes 
the following remark: “If | am given the set of strategies 
a= {a,,@2,a3}, | can calculate the probabilities the players 
should attribute to the various nodes.” For example, if 
player 1 plays each of his actions with probability 4, 
player 2 must assign a probability of 4. = 4 to node ny 
if the game attains his informational set. Moreover, if 
a, = R,, player 3 assigns probability yz; = 1 to node ng. 
What happens if player 1 plays a, = L, (with probability 
1? Then player 2 can attribute any probability ., to node 
n,, since every probability is compatible with Bayes’ law 
(because the event “the game attains player 2's informa- 
tional set” has zero probability).*' The statistician reports 
for each set of strategies the set of beliefs that are com- 
patible with these strategies in a Bayesian sense: j1"™ (a). 

The conclusion to this parable is easy to guess. The 
economist calls in his two consultants. The statistician 
proposes that the strategies he uses to calculate the prob- 
ability be the strategies actually chosen by the players, 
which the game theorist should give him. In turn, the 
game theorist suggests that the players’ subjective prob- 
abilities be based on a study of the other players’ be- 
haviors, and calls on the statistician to provide him with 
these probabilities. Then both agree that the notion of 
equilibrium should make explicit these two types of com- 
patibility. Consequently, they define a perfect Bayesian 
equilibrium as a set of strategies a that satisfy 
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21. However, we can, following Kreps and Wilson 1982, require that be 
consistent with 413. For example, if ji; = and player 2 plays L, with prob 
ability $, then uy = 4 (see below), 
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(114) 


we nu (a"([0). (11.2) 


timal strategies (and the associated beliefs) 
|-point condition: 


are optimal given beliefs. 
(B) Beliefs ate obtained from strategies and observed ac- 
tions using Bayes’ rule. 


The conctpt of perfect Bayesian equilibrium was intro- 
duced into he formal game-theory literature by Selten 
(1975) as “perfect equilibrium” and by Kreps and Wilson 
(1982) as ‘sequential equilibrium.”?? Although Selten 
pioneered this concept, Kreps and Wilson must be cred- 
ited for putfing more emphasis on beliefs,?* making the 
concept mofe applicable, and paving the way for refine- 
ments based on restrictions on beliefs for zero-probability 
events (see fection 11.6 for an example). 


Remark We have been very informal about what Bayes- 
ian updating means. At the very least, the information 


calculate the perfect Bayesian equilibrium 
) of a game? The characterization of a PBE 


depicted in figure 11.10 is actually trivial. Player 3 has a 
dominant strategy that consists of playing L,; therefore, 
regardless of ji3, 4, = Ls. Hence, we can convert the 
game to the two-period, two-player game depicted in 
figure 11.11. (Since player 3 does not play in this game, 
we omit the values of his objective function.) 

We observe that player 2 now has a dominant strategy: 
L,. Therefore, player I's optimal action is M,, Conse- 
quently, the unique PBE is given by a = (M,,L3,L), and 
it has associated to it a set of beliefs (4. = 1, us 10, 1). 

This game is indeed trivial. Because of the existence of 
dominating strategies, the game theorist finds the equilib- 
rium path without the help of the statistician. 

We now solve four simple but nontrivial games that 
are relevant to industrial organization. In these games 
(in contrast with the previous game) there is interaction 
between backward induction and forward Bayesian infer- 
ence. The first two are games of incomplete information 
in which an informed party may reveal information when 
choosing between two actions. In the third game, a player's 
private information can be revealed through a complex 
(two-dimensional) action. There, simple economic reason- 
ing indicates what the economic path must be. The 
fourth game gives an example of a game of imperfect 
information. 


Figure 11,11 
Game 6. 


22, Kreps and Wilson have shown that pertect and sequential equilibria coin- 
cide for almost pll games 


23. Selten defined his concept of “trembling hand” perfect equilibrium on the 
normal form. Beliefs, although easy to compute, are implicit rather than explicit, 
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prefers to accheninodats rather hizo’ to one 
r, it would most prefer to be a monopoly, in 


», if firm 1 is crazy, The idea is that, unless it is 
1 will not pursue any predatory strategy in the 
eriod because there is no point building or keep- 
putation at the end. (This assumption can be 
more formally from the description of the second- 
competition.) The sane firm 1 gets D, if firm 2 
id M, > D, if firm 2 exits. We let 5 denote the 
factor between the two periods. 

resumed that the crazy type always preys. The 


s begin with a taxonomy of potential perfect 
equilibria. 
rating equilibrium is an equilibrium in which firm 


t's type, then 
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and 
p(t = crazy|prey) = 1. 

A pooling equilibrium is an equilibrium in which firm 1's 
two types choose the same action in the first period. 
Here, this means that the sane type preys. In a pooling 


equilibrium, firm 2 does not update its beliefs when 
observing the equilibrium action: 


u(t = sane|prey) = p,. 


There can also exist hybrid or semi-separating equilibria. 
For instance, in the reputation game, the sane type may 
randomize between preying and accommodating (ie., 
between pooling and separating). One then has 


ult = sane|prey) € (0, p;) 
and 
pt = sane|accommodate) = 


Let us first look for conditions of existence of a sepa- 
rating equilibrium. In such an equilibrium, a sane firm 
1 accommodates and thus reveals its type and obtains 
D,(1 + 6). (Firm 2 stays, because it expects D, > 0 in the 
second period.) If firm 1 decided to prey, it would con- 
vince firm 2 that it is crazy and would obtain P, + 6M,. 
Thus, a necessary condition for the existence of a sepa- 
rating equilibrium is 


6(M, — D,) < (D, — Py). (113) 


Conversely, suppose that condition 11.3 is satisfied, Con- 
sider the following strategies and beliefs: The sane incum- 
bent accommodates, and the entrant (correctly) anticipates 
that the incumbent is sane when observing accommoda- 
tion; the crazy incumbent preys, and the entrant (correctly) 
anticipates that the incumbent is crazy when observing 
predation. Clearly, these strategies and beliefs form a 
separating PBE. 

Let us now look at the possibility of a pooling equi- 
librium. Both types prey; so, as we saw, # = p, when 
predation is observed. Now, the sane type, who loses D, 
— P, in the first period, must induce exit (at least with 
sufficient probability). Therefore, it must be the case that 


py Dz + (1 — py) Pz <0. (11.4) 


Conversely, assume that condition 11.4 holds, and con- 
sider the following strategies and beliefs: Both types prey, 
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and the entrant has posterior beliefs 4: = p, when preda- 
tion is observed and x = 1 when accommodation is ob- 
served. The sdne type’s equilibrium profit is P, + 6M,; it 
would become D,(1 + 6) under accommodation. Thus, if 
condition 11. is violated, the proposed strategies and 
beliefs form aj pooling PBE. (If condition 11.4 is satisfied 
with equality] there exists not one but a continuum of 
such equilibriq.) If conditions 11.3 and 11.4 are violated, 
the unique eqhilibrium is a hybrid PBE (with the entrant 
randomizing When observing predation). 

The following exercise extends this analysis to n 
periods, 


Exercise 11.12°}* 

(i) Selten (1P78) offers the following paradox. A chain 
store (the incumbent) is located in n geographically sepa- 
rated markets] It faces n potential entrants, one in each 
market. The potential entrants must make their entry 
decisions sequentially. At date i (i = 1,...,), entrant i, 
who has observed what happened in the i — 1 preced- 
ing markets, decides whether to enter (J, for in) or not 
enter (O, for |out). If it enters, the incumbent chooses 
whether to prdy (P) or acquiesce (A); the payoffs in market 
jare as shown|in figure 11.12 (the first payoff refers to the 
incumbent, th¢ second to entrant j),The entry and preda- 
tion decisions] in market i are observed before entrant 
i + 1 decides whether to enter. The incumbent's payoff is 
the sum of its n payoffs, Show that with one potential 
entrant (n = 1), the unique perfect equilibrium has the 
entrant, enter and the incumbent acquiesce. Then demon- 
strate Selten’s chain-store paradox: No matter how large 
nis, all the entrants enter and the incumbent never preys. 

(ii) Now, “pos with Kreps, Milgrom, Roberts, and 
Wilson, that the entrants do not know the incumbent's 


Figure 11.12 
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objective function perfectly. With probability 1— x, the 
payoffs are as above; with probability x, the incumbent 
gets $ (say) instead of —1 when preying (the payoffs are 
otherwise unchanged). Solve the one-entrant case. Then, 
by backward induction, derive the equilibrium for entrant 
i for posterior beliefs x; at the beginning of date i. How 
big should x, = x be to keep entrant 1 out? 


11.5.2 Sequential Bargaining with Incomplete 
Information 


Bargaining usually involves asymmetric information. For 
instance, the seller (respectively, the buyer) may have 
incomplete information about the buyer's willingness to 
pay (respectively, the seller's reservation price), To the 
extent that bargaining proceeds through a series of offers, 
refusals, counteroffers, and so on, it is natural to model it 
as a dynamic game of incomplete information. A simple 
instance of such a formalization is given here. (A straight- 
forward reinterpretation of such a game was used in for- 
malizing intertemporal price discrimination by a monop- 
olist in chapter 1.) 

Consider the following simple bargaining problem: A 
buyer and a seller negotiate over one unit of a product (or 
a contract). The seller makes an initial offer of p,, which 
the buyer either accepts or refuses, If the offer is refused, 
the seller makes a second offer, p3. If the second offer is 
also refused, each party goes his own way and the seller 
keeps his product. The value of the product is s for the 
seller and b for the buyer. (Value must be interpreted in a 
broad sense that includes the possibility of outside ex- 
changes with other parties.) Assume that the discount 
factor is 6, for the seller and 6, for the buyer and that both 
parties are risk-neutral. Therefore, the utilities of the seller 
and the buyer are [p,,b — p,] if p, is accepted, [d,p2, 
5,(b — p2)] if p is accepted, and [4,s, 0] if p. is refused. In- 
complete information is restricted to the following aspect: 
The seller does not know whether the value of the product 
is b or b (b <b) for the buyer. The seller puts equal prob- 
abilities on these two values, whereas the buyer knows b. 
Both parties know everything else. We assume that there 
is always some potential gain from exchange: s < b < b. 
Moreover, we assume that b > (b + s)/2. This condition 
implies that if the seller were authorized to make only one 
offer, he would choose to sell surely (by charging b) 
rather than run the risk of losing the sale (by trying to sell 
at price b), We now define the strategies and the beliefs. 
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First, the seller makes an offer p,. The buyer accepts 


from this a posterior probability that the buyer's 
ess to pay is equal to b, which we denote as 
(b|p,)| Obviously, (b|p,)=1—(blp,). The seller 
then makes a subsequent offer, p2(p,). Finally, the buyer 
accepts}, (d; = 1) or refuses p (d, = 0), according to 
the deqsion rule d,(p,,p2,b). (In equilibrium, p, will not 
be an atgument of d..) 

(1) The first step consists of writing the “self-selection 
constraints” that must be satisfied by an equilibrium path, 
By self-selection constraints we mean (in a general enough 
way) the constraints expressing the fact that in equili- 
brium aj player of a given type does not strictly prefer to 
adopt aj strategy other than his own (such as that of the 
same player when he is of a different type). Here, the 
buyer can be of two types, b and b. We refer to “the buyer 
of type|b” (respectively, “type b”) to designate the buyer 
who atfaches a value b (respectively, 5) to the product. 
During |the second period, self-selection constraints are 
trivial; {he buyer of type b buys if and only if p, <b. 
Analogpusly, the buyer of type b will accept an offer p, 
during {he first period if and only if 


5,{max[b — p2(p,),0]}. (11.5) 


11.5 represents the following: If the buyer 
accepts |offer p,, his utility is b — p,; and if he refuses, 
the selldr charges p,(p,). 

Writing equation 11.5 for b and b shows that if the 
buyer af type b accepts p,, then, @ fortiori, the buyer of 
type b dccepts p, (simply because he attaches more value 
to the product than the buyer of type b and is, therefore, 
more eager to buy). 

(2) The second step is to look at the consequences 
of the gelf-selection constraints on the seller's posterior 
probability distribution over b. Clearly, we may assume 
that p, |was refused (otherwise, bargaining is completed, 
and thg distribution no longer matters). Since an offer 
accepted by the type-b buyer is automatically accepted by 
the typ¢-b buyer, the probability of facing the latter type 
when the offer is refused is, at most, 4. 

(3) The third step is to return to the strategy space by 
examining the effects of this distribution on the strategy 
of the eller in the second period. When the seller can 


make only one offer, and his distribution over b is uni- 
form he behaves cautiously by assumption (that is, he 
charges b), A fortiori, when his subjective probability of 
facing buyer 6 is less than 4 he must also behave cau- 
tiously; therefore, regardless of p,, we have p3(p,) = b. 
Now the final two steps of the characterization of the 
equilibrium are obvious. 

(4) Forecasting that the seller will charge b if he refuses 
the first offer, the buyer of type b accepts it if and only 
if p, <b. Buyer b accepts p, if and only if b—p, 
> 6,(b — b), or simply 


py <b =6,b+ (1 —4,)b. 


(5) Finally, the seller chooses p, in order to maximize 
his expected profit. He chooses between b and b de- 
pending on whether b is greater or less than 


(6 + 6,b)/2. (11.6) 


If he proposes b, this offer is accepted by both types of 
buyer. On the other hand, if he proposes b, he benefits 
from buyer b’s impatience, knowing full well that he will 
be able to enter into an exchange in the second period if 
the buyer turns out to be of type h, Because during our 
characterization we defined the players’ strategies and 
beliefs for each history of the game, we conclude that the 
game has a unique perfect Bayesian equilibrium. 

A more general analysis of this model, including the 
case in which the buyer also has incomplete information 
about the value of the product to the seller, can be found 
in Fudenberg and Tirole 1983. 


11.5.3 Warranty as a Signal of Quality 


A buyer is willing to buy one unit of a product of uncer- 
tain quality (for example, a used car), More precisely, the 
product has a probability I of working properly and a 
probability 1 — IT of being defective. The seller knows 
TI; so 11 is his “type.” The buyer knows that IT belongs 
to an interval [1,1], and he has a prior probability dis- 
tribution defined over this interval. The buyer receives 
utility @ if he does not purchase from this seller (for 
example, i is the utility he receives if he does not con- 
sume the product; alternatively, a is the buyer's reserva- 
tion utility which is related to his willingness to pay for 
searching among other sellers). Let J, be the gross mone- 
tary value of a working product, and J, the gross mone- 
tary value of a nonworking product (I, > 1,). The seller 
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work, The 
it is ul; — 
tion of the 


Insumer's utility is u(J, — p) in the first case; 
+ g) in the second, where the utility func- 
yer who consumes the product is assumed 


as an expected profit of p — (1 — I)g if the 
the contract (p, g). The buyer and the seller 


the buyer is fompletely insured. In other words, the war- 
ranty always specifies ¢4 = I, — 1). 

(2) In the jcase of incomplete information, if the seller 
proposes tha indemnity payment go, the buyer's utility 
when he consumes the product does not depend on his 
subjective pyobability distribution over I. Therefore, he 
accepts any price that does not exceed po, where po is 
defined by 


u(I, — po) = M(lz — po + So) = 


(To simplify the analysis, we assume that py — (1 — I)gq 
> 0.) When] the seller proposes cy = (poo). the buyer 
obtains zero|surplus for any TT 

(3) The cgntract co is Pareto optimal for any 11.24 On 
the other hafd, the seller would offer some other contract 
¢ = (p.g) oly if he expected to make more profit than 
with co. Singe any other contract is suboptimal, the buyer 
will derive negative surplus from such an exchange, 
and therefore he should not make the purchase, In other 
words, a ratlonal buyer should be suspicious if the seller 
does not offpr a complete warranty. 

In a two-period game, we formalize our intuition that 
the seller pyoposes the contract cy in equilibrium. First, 
the seller proposes a contract, c = (p.g), which depends 
only on his|own information, i.e., c(M). Then, the buyer 
chooses to buy (d(c) = 1) or not to buy (d(c) = 0). His 
behavior depends on his beliefs about I and, more pre- 
expected I, given the buyer's prior prob- 
ability distrbution over I and the information in the 


contract proposed by the seller, ¢ = (p,g). Let ju(c) be this 
posterior expectation. In perfect Bayesian equilibrium, c(I) 
maximizes the seller's expected profit with his objective 
probability specified by MI and with d(-) taken as given. 
n(c) is consistent with the function c(-) in the Bayesian 
sense, and d(c) maximizes the buyer's expected utility 
given plc). 

Assume that the seller, who has probability 1, pro- 
poses a contract c = (p,g) that differs from co = (po, 0). 
Since d(co) = 1, the seller's expected profit for ¢ must 
satisfy 


p—(1— Mg > po — (1— Mgo. (11.7) 


This inequality must be satisfied for all the values of Tl for 
which the seller chooses c. Since ju(c) is (from Bayes’ law) 
a weighted average of these values, we have 


p—l1—we)lg 2 po — 11 — neo. (11.8) 


On the other hand, the buyer's surplus is non-negative 
(otherwise, he would not make the purchase, and the 
contract could not be optimal for the buyer). Therefore, 


wle)u(l, — p) + [1 — wlo)Jul, — pp +9) >0 


= ull, — po). (11.9) 


Using Jensen's inequality (if f(x) is strictly concave, then 
f(Ex) > Ef(z), where the expectation operator refers to x), 
we get 


Hl), — p) + 11 — wl, — p + 9) > 1, — po. 
(11.10) 


It is easy to see that 11.8 and 11.10 are contradictory 
(using go = 1, — 1,). 

Once we have observed that for any [1 the equilibrium 
offer should be co, which is accepted by the buyer, it is 
elementary to construct a PBE that generates this equi- 
librium path, Any contract c different from cy must have 
zero probability in equilibrium. Therefore, we can choose 
the beliefs freely (while ju(co) is necessarily the expecta- 
tion of TI for the prior distribution, given pooling and 
Bayes’ rule). It suffices to assume that the buyer believes 
that such a contract is proposed by the “worst” seller (the 
seller with the lowest probability I). Therefore, if the 


24, The buyer qbtains full insurance under c, and, therefore, does not care 
about the true value of 11 
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seller prgposes contract c = (p,g), the buyer accepts it if 


, it is clear from this inequality that the seller's 
profit is less than if he had proposed cy.?* 


11.5.4 |Signal Jamming 


Each of the preceding three examples involved a player 
with private information who attempted to manipulate 
another party's beliefs about this information. This fourth 
example |shows how beliefs can be manipulated under 
imperfect (but not incomplete) information. In this exam- 
ple, a player garbles the payoff-relevant information 
received by another player by choosing an unobservable 
action. 

The simplest such model involves two firms, two ac- 
tions, tw@ types, two profit levels, and two periods. Firms 
1 and 2 fompete in period 1. Firm 1 has two possible 
actions: 9 (soft; accommodate) and T (tough; prey). Firm 
2's actior is not described here (one can think of it as 
always ag¢commodating). Firm 1 has perfect information 
about its pwn payoff. From a single-period point of view, 
it prefers [playing S. Playing T costs ¢ > 0. This is com- 
mon knowledge. Firm 2 has two potential profit levels: H 
and L, with H>0O> L. The a priori distribution puts 
weight «jon H and weight 1 — a on L if firm 1 plays S. 
(Thus, th¢re are two “types,” H and L; the terminology is 
abused hqre, because we will assume that firm 2 does not 
know its type.) However, if firm 1 plays T, firm 2 makes 
profit L régardless of its type. Firm 2 does not observe 
whether 1 plays S or T in the first period and does 
not knowlits type. Instead, it must infer this type from the 
observatign of its profit and what it expects firm 1 to 
have played. Except for firm 1's action, the two firms have 


ince this is the last ad firm 1 plays its 


period gain of becoming a monopoly for firm 1 is g > c. 
(For simplicity we will assume that g and c do not depend 
on firm 2's type, although this is not essential.) Firm 2 
earns zero profit in the second period if it leaves.?° 

To solve for equilibrium, let us consider the two poten- 
tial actions for firm 1 in period 1. Suppose it plays S in 
equilibrium. Firm 2's first-period profit is identical to, and 
therefore perfectly informative about, its second-period 
profit, Thus, it stays if and only if it makes profit H in 
the first period. Now, S must be the equilibrium action: 
Playing T does not increase firm 1's profit, By playing 
T in the first period, firm 1 loses c, but gains g whenever 
it changes firm 2's decision to stay. This occurs when firm 
2 is of type H. Playing T gives firm 2 a profit L, and 
that firm—thinking firm 1 has played S—leaves. Thus, a 
necessary condition for S to be an equilibrium action is 


cp ag. (11.11) 


Conversely, if condition 11,11 is satisfied, firm 1's playing 
S and firm 2’s use of the exit rule “exit if and only if 
first-period profit is equal to L" form a perfect Bayesian 
equilibrium. 

Second, suppose that T is the equilibrium action. Then 
profit is L whatever firm 2's type: The first period profit 
is uninformative. So the posterior beliefs when observing 
L are the same as prior beliefs. Firm 2 stays when observ- 
ing L if and only if 


aH + (1—a)L 20. (11.12) 


If condition 11.12 is satisfied, T cannot be an equilibrium 
action. Firm 1 could save c without any change in second- 
period profit. If 11.12 is not satisfied, then playing T is an 
equilibrium action as long as the cost of doing so (c) is 
lower than the gain, xg (if firm 1 deviates and plays 5, firm 
2 exits if and only if it is of the low-profit type). So T is 
an equilibrium action if and only if both 11.11 and 11,12 
are violated. 

When condition 11.11 is violated and 11.12 is satis- 
fied, we already know that no pure-strategy equilibrium 
exists. Hence, if an equilibrium exists (we actually know 
it does—see section 11.6), it must involve firm 1’s mixing 


publication of Spence’s 1977 article on consumers’ perceptions of product 
quality, ‘Our discussion has followed Grossman 1980, See also Milgrom: 1981b. 
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26. This signaljamming game is 2 simplification of that in Fudenberg, and 
Tirole 19863. Other si games can be found in Holmstrom 1983, 
in Riordan 1985, and in Gibbons 1985. 
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between S and T. Suppose that firm 1 plays T with proba- 
bility y (and| S with probability 1 — y), and that firm 2, 
when obserying profit L, exits with probability z (and 


stays with probability 1 — z). For firm 1 to randomize, it 
must be indifferent between the two actions; hence, 

c= ozg. (11.13) 
(Playing T modifies firm 2's decision only when it is of 


type H, and this decision is changed only with probability 
z.) Because tondition 11.11 is violated, condition 11.13 
defines a unjque z in (0, 1). Next, for firm 2 to be indif- 
ferent about|exiting or staying when observing L, it must 
be the case that 


nH + (1 — #)L =o, 


where 1 Te the posterior probability that firm 2 is of 
type H. From Bayes’ rule, 


< on 
4 ora a 


{When the ic type is H (respectively, L), the probability 


of ending up with profit L is equal to the probability y of 
predation (respectively, 1).] Thus, we need 


avH + (14a)L =0, (11.14) 


Because coridition 11.12 is satisfied, condition 11.14 de- 
fines'a unique y in (0, 1]. We thus conclude that there 
exists a li equilibrium in this game. 


These foyr games were selected for their simplicity, In 
particular, each of them has a unique equilibrium. Often, 
however, dynamic games of incomplete or imperfect in- 
formation ate plagued by a multiplicity of equilibria. This 
is due in pafticular to the fact that Bayes’ law has no bite 
when the observed move has a probability of zero.*” The 
leeway in specifying conjectures in such events may give 
rise to a great number of equilibria. The supplementary 
section provides examples of multiple equilibria and shows 
how one may select from among these using refinements 
of perfect Bayesian equilibrium. 
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11.6 Supplementary Section 
11.6.1 Existence of Equilibrium 


The basic existence result concerns the existence of a 
Nash equilibrium. The existence of Bayesian, perfect, and 
perfect Bayesian equilibria is proved through simple 
reinterpretations of this result, Consider a normal-form 
game with a finite number of players (i = 1,...,n). Let A; 
denote the set of player j’s feasible actions, a = (@,,-.., 
Aj, ..«)M,) the vector of actions (where a, belongs to A,), 
and T1‘(a) player i’s payoff. The following theorem is a 
special case of one contained in Debreu 1952. (Debreu 
also allows a player's set of feasible actions to depend on 
the other players’ actions.) 


THEOREM If, for all i, A, is a compact and convex subset 
of a Euclidean space, and I! is continuous in @ and quasi- 
concave in qj, there exists a Nash equilibrium, i.e., a vector 
a such that, for all i and a; in A), 


TI'(a*) > M'(a,,a*,). 


This theorem (which can be proved by a fixed-point 
method) has a straightforward application to games with 
a finite number of actions. It shows that for such games an 
equilibrium always exists in mixed strategies (it need not 
exist in pure strategies; see, e.g., the game of matching 
pennies): Let A) denote the set of probability distributions 
over the finite set A, of pure strategies. So we enlarge the 
action set to allow mixed strategies: a, € A,. A; is homeo- 
morphic to a simplex, and hence is compact and convex. 
TI' becomes an expectation over pure-strategy outcomes; 
it is therefore linear (hence, quasi-concave) in a, and poly- 
nomial (hence, continuous) in a. So, there always exists a 
mixed-strategy equilibrium (Nash 1950). 

The above theorem (or variants thereof) also provides 
conditions for the existence of an equilibrium in games 
with a continuum of actions. However, some games in 
industrial organization (e.g., auctions, Cournot competi- 
tion, location games) have discontinuous and/or non- 
quasi-concave payoff functions. See Dasgupta and Maskin 


positive.probabiity event (Bayes’ rule applies), and “accommodating” always 
signals the sane fype. Thus, the leeway does not exist. Similarly, in the bargain- 


27, In the te game, the crazy type always preys, so “preying” is a 
ing game. “refudng" is a positive-probability event in the interesting region for 


price offers (prices above b, which are rejected by type ), The warranty game 
is special in that the Pareto-optimal action (which is the equilibrium action 
under complete information) is the same for all types, In the signal-jamming 
game, the incumbent's move is not directly observed. 
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1986 for sufficient conditions for a pure-strategy equilib- 
rium (with quasi-concave payoff functions) and a mixed- 
strategy equilibrium (in the absence of quasi-concavity) to 
exist, 

The theorem can also be used to prove the existence of 
equilibtium for versions of the Nash concept for incom- 
plete information or dynamics. Let us assume that each 
player has only a finite number of feasible pure strategies. 
Similarly, under incomplete information, each player i has 
only a finite number of potential types | T)| (i.e., nature has 
only a finite number of pure strategies), 


11.6.1,] Existence of a Bayesian Equilibrium 


It suffies to transform the game between the n players 
into a]game between )f.,|7j| players. That is, each 
player has | 7;| incarnations that play in their own interest. 
(Player|i, when of type t,, does not care about the payoff 
he would have were he of type tj.) This is still a game 
with a finite number of players and a finite number of 
pure sttategies. Hence, it admits a mixed-strategy equilib- 
rium. The equilibrium strategies in the transformed game 
are clearly equilibrium strategies of the original game. 


11.6.1.4 Existence of a Perfect Equilibrium 


For games of perfect information, Kuhn's algorithm of 
backward induction yields a constructive proof of the 
existente of a perfect equilibrium. More generally, the 
existente of a perfect equilibrium results from the gen- 
eral prdof for perfect Bayesian equilibrium (although for 
games pf almost perfect information there exist simpler 
proofs) 


1.6.1.4 Existence of a Perfect Bayesian Equilibrium 


A "trerhbling hand” perfect equilibrium (Selten 1975) is a 
slightly} more refined concept than Kreps and Wilson's 
(1982) sequential equilibrium, which itself is a bit more 
refined |than perfect Bayesian equilibrium; thus, it yields 
existen¢e of a PBE and sequential equilibrium as by- 
products. 

Congider the normal form of a game. For a given set of 
actions}; for player i, we can define the set of totally 
mixed gtrategies for that player: 


wa bews 


¥Y a(a,) = 1, g,(a,) > 0 for all at. 
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That is, o;(a;), the probability that player i plays a;, is 
required to be strictly positive. 

Fix ¢, and let {e(a;)},,<.4, denote a set of numbers such 
that 0 < e(a,) < ¢ for all a;. Now, consider the following 
maximization problem for player i: 


max I'(¢;, o_;) subject to a,(a;) > e(a,) for all a;, (11.15) 


where a_, = (0,..; 1, Fi41+ +++» 0) denotes the mixed 
strategies played by the other players. In other words, 
player i is constrained to play each of his possible strategies 
with at least a small probability. 

An “s-perfect equilibrium” is a set of (totally mixed) 
strategies {g,}7-, such that, for some {é(a;)},,<4, with 
0 <e(a;) <<, g; solves 11.15 for each player i. In other 
words, an ¢-perfect equilibrium is a Nash equilibrium of a 
constrained game. For given {e(a;)}, such a Nash equilib- 
rium exists from Debreu’s theorem. (The only difference 
with the proof of existence of a mixed-strategy equilib- 
rium is that mixed strategies must belong to subsets of 
the simplices; however, this is irrelevant, because the 
subsets are compact and convex.) 

A “trembling hand” perfect equilibrium is any limit of 
e-perfect equilibria as ¢ tends to zero, Because the strategy 
spaces are compact, such a limit exists (there always exists 
a converging subsequence). Because the profit functions 
TI’ are continuous, any limit is a Nash equilibrium (from 
condition 11.15). 


Remark 1 When a player plays at various information 
sets, this equilibrium concept ends up being not quite 
refined enough. Selten introduces a second refinement, 
which operates like the first but on the so-called agents’ 
normal form, (The agents’ normal form consists in con- 
sidering each information set as a distinct player, with the 
objective function the player of whom it is an incarna- 
tion. This defines a normal form with a larger number of 
players, to which the preceding techniques can be applied. 
The difference with the previous approach is basically 
that a player's trembles at different information sets must 
be independent, so there are fewer “trembling hand” per- 
fect equilibria. The concept of a “trembling hand” perfect 
equilibrium actually refers to this second refinement). For 
a discussion, see Fudenberg and Tirole 1986c. 


Remark 2. The main point is that a “trembling hand” 
perfect equilibrium is not only a Nash equilibrium; it is 
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also a PBE| This is where Selten’s trick of introducing 
trembles pays. The perturbed game, with the required 
minimum bles e(a;), has no zero-probability action. 
Hence, in an extensive form (with the normal form de- 
fined by tHe above game) there is no zero-probability 
event. Bay¢s’ rule bites everywhere, and Nash equilibria 
automatically satisfy the perfection requirement, (To 
visualize this, go back to game 1. If player 1 is forced to 
play R with probability at least e(R) > 0, player 2 neces- 
sarily puts]as much weight as he can on r following R. 
Hence, / will not be a limit of optimal reactions to R, even 
when «(R)|tends to 0. On the normal form of game 1, 
Selten’s costruction yields only the unique perfect equi- 
librium.) Thus, noncredible threats are not part of a Nash 
equilibrium] of a perturbed game, nor are they part of a 
“trembling|hand” perfect equilibrium in the limit. 


Remark 3 | Whereas Selten works on the normal form, 
Kreps and Wilson (1982) use the extensive form and put 


criterion has no bite. However, in more gen- 
it (among other things) imposes consistency 
liefs of different players or of the same player 


An issue that was carefully eluded in the text is the com- 
mon and high multiplicity of equilibria in dynamic games 
with incomplete or imperfect information. To understand 
why this issue often arises, consider a game in which 
player 1 has private information and plays first and player 


2, who cares about player 1's information, reacts to player 
1's action. (Such a game will be called a signaling game 
below.) Suppose that we want to rule out some action a, 
as an equilibrium action for player 1. To this purpose, let 
us assume that a, is indeed not optimal for player 1 (that 
is, a, has “probability 0 in equilibrium"—i.e., is an “off- 
the-equilibrium-path” or an “out-of-equilibrium” event), 
In this event, Bayes’ rule has no bite and any beliefs about 
player 1's type after a deviation to a, are admissible. In 
many games there exist types for player 1 that, if they 
were common knowledge, would induce player 2 to take 
some action that would hurt player 1 considerably, For 
instance, if player 2 thinks that player 1 has a high mar- 
ginal cost or that demand is high, he will enter the market 
or accumulate a large capacity. Now, if we specify that 
after observing a, player 2 believes that player 1 is of 
such a type, so that player 2 indeed takes an action 
detrimental to player 1, then player 1 does not want to 
choose a, after all. The leeway in specifying off-the- 
equilibrium-path beliefs usually creates some leeway in 
the choices of equilibrium actions; by ruling out some 
potential equilibrium actions, one transforms other actions 
into equilibrium actions. Hence, it is not surprising that 
one often ends up with a continuum of perfect Bayesian 
equilibria, 

But multiplicity, as usual, raises doubts about the very 
nature of equilibrium. How do players coordinate on a 
particular equilibrium? Do they choose a "focal" one? Do 
they learn? If they do, what is the learning process? Many 
of the recent developments in game theory concern the 
refinement of the notion of equilibrium by placing restric~ 
tions on off-the-equilibrium-path events, where Bayes’ 
rule does not place restrictions. We are now endowed 
with nearly a dozen refinements of perfect Bayesian equi- 
librium. Although the field is changing rapidly and these 
notes will soon be outdated, let us consider two such 
refinements which have been used often and are easy to 
apply.2* 

Subsection 11.6.2.1 (which follows Fudenberg and 
Tirole 1986c) describes the simplest game in which the 
issues of updating and perfection arise; the signaling game. 
Because of the complexity of dynamic games with incom- 
plete information, industrial-organization economists have 


28 For more complete discussions of refinements, see Fudenberg and Tirole 
1986c. Cho and Kreps 1987, and the references in these two papers. 
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iny (perhaps too many) applications of this 
. We consider how to apply the two refine- 


The following game is called a signaling game because a 
variant off it was used by Spence (1974) to study job- 
market signaling. There are two players. Player 1 is the 
leader (alsp called the sender, because he sends a signal); 
player 2 |is the follower (or receiver). Player 1 has 
private information about his own type f, in T,, and 
chooses agtion a, in A, (the set of probability distribu- 
tions over A, is A,). Player 2, whose type is common 
knowledg¢ for simplicity, observes a, and chooses a; in 
Ay. Payoffs are equal to TI'(a,,a3.t,), where i = 1,2. 
Before the| game begins, player 2 has prior beliefs p, (t,) 


Player 2| who observes player 1’s move before choosing 
his own action, should update his beliefs about +, and base 


A perfect Bayesian equilibrium (PBE) of the sig- 
is a set of strategies af(t,) and a3(a,) and 


(P,) ag (t, Je arg max IT (a, a3(ay), t,). 
a 


(B) p, (t; |, )Jis derived from the prior p,(*), a,, and af(-) 
using Bayes} rule (when applicable). 

(P,) and (P,) are the perfectness conditions, (P,) states 
that player P reacts optimally to player 1's action given 
his posteriof beliefs about t,. (P,) demonstrates the opti- 
mal Stackelerg behavior by player 1; note that he takes 
into account the effect of a, on player 2's action. (B) 
corresponds] to the application of Bayes’ rule, The quan- 
tifier “when|applicable” stems from the fact that if a, is 


447 


not part of player 1's optimal strategy for some type, 
observing a, is a zero-probability event and Bayes’ rule 
does not pin down posterior beliefs. Any posterior beliefs 
P, ( {@;) are then admissible. 

Both of the refinements below put restrictions on “rea- 
sonable” beliefs 7,(-|a,) following an out-of-equilibrium 
action a. 


Elimination of Weakly Dominated Strategies 
Let a, and a; denote two actions in A,. 


Definition 1a, is weakly dominated by aj for type t, in 
T, if, for all a, and a3 in Az, 


Tay, ay, 4) <TH (ajay, ty) (11.16) 


(with at least one strict inequality for some (a3, 5). a, is 
weakly dominated for type f, if it is weakly dominated by 
some action aj for type f,. 


Now suppose that a; is not an equilibrium action (i.e., 
is never played in equilibrium by any type). Although 
Bayes’ rule allows any posterior beliefs p,('|a,), player 2 
should not put any weight on types for whom a, is 
(weakly) dominated. Thus, we define, for the given ac- 
tion a, 


J = {ty € Tyla; is weakly dominated for type t, }. 


A “reasonable” restriction on beliefs following a, is that 
P,(" |@,) have support T, — J, ie. 

E Alhla,) =o. 

Wes 

(We assume that T, is countable and use the summation 
sign, but this is irrelevant to the argument.) Because we 
restrict beliefs, we reduce the number of potential off-the- 
equilibrium-path payoffs, and therefore we make it harder 
for an equilibrium to be sustained, 

We can further refine the equilibrium notion by en- 
larging the set of weakly dominated strategies. Definition 
1 requires that a, be dominated by a} for any reactions a, 
and a}. But not all reactions are plausible. After all, player 
2, whatever his posterior beliefs, will adopt an optimal 
reaction, So, let us introduce the set of best reactions 
(responses) to some action a, for arbitrary posterior 
beliefs p,: 


BR(j,.a,) = arg max ( y ith Ma. 43-4)). 


a2€ 4 \H ET; 
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BRiLa = 


{Po Pitt=1} 


BR(p,,a,) 


to a, 

A stronger restriction on out-of-equilibrium beliefs be- 
gins by replacing “for all a and a3 in A," in definition 1 
with “fdr all a, in BR(T,,a,) and a} in BR(T,,a{).” This 
increasef the number of weakly dominated strategies and 
tightens the allowable support of posterior beliefs. (Re- 
quiring {hat a, and a3 be best responses is in the spirit of 
the iterated elimination of dominated strategies discussed 
in the taxt.) 


Elimination of Equilibrium Weakly Dominated Strategies 


Often there are too few types for which a, is weakly 
dominated for the support of posterior beliefs to be pinned 
down sufficiently. The “intuitive criterion” of Kreps (1984) 
and Chq and Kreps (1987) offers to increase the number 
of dominated strategies by looking at strategies domi- 
nated by the proposed equilibrium outcome.*® Consider a 
equilibrium with payoff TI'*(t,) for type fy. 
that player 1 deviates from his equilibrium 


Definition 2 


a, is equilibrium weakly dominated for type 


(11.16') 


if everybody really believes that the proposed equilibrium 


is the one being played (a basic tenet of the equilibrium 
notion), player 2 knows that types in J have no incentive 
to play a,—whatever the associated posterior beliefs, 
they do not do better than if they follow their equilibrium 
strategy. A weakly dominated strategy is automatically 
an equilibrium weakly dominated strategy. 

Thus, it seems natural to restrict beliefs to T, — J (see, 
however, footnote 32). Some problems may arise, how- 
ever. For instance, T, — J may be empty. Hence, the intui- 
tive criterion needs to restrict beliefs to T, — J only when 
some other condition is satisfied. For instance, Cho and 
Kreps (1987) suggest restricting beliefs to T, — J when 
the following condition is met: For all az in BR(T, — J. #), 
there exists t, (in T, — ]) such that 


TI (ay, 43,4) > M'*(), (11.17) 


Condition 11.17 states that, no matter what posterior 
beliefs are formed that do not put weight on J, there exists 
some type ft, who would like to deviate. 

The intuitive criterion thus requires a perfect Bayesian 
equilibrium to exhibit no off-the-equilibrium-path action 
a, and no subset J of types such that a, is weakly domi- 
nated for all types in ] and condition 11.17 is satisfied, 
(For finite games, a perfect Bayesian equilibrium that 
does not satisfy the intuitive criterion is not a part of a 
stable component in the sense of Kohlberg and Mertens 
[1986].*° Thus, from the Kohlberg-Mertens existence re- 
sult, an equilibrium satisfying the intuitive criterion exists.) 


11.6.2.2 Examples 
Example 1 


Consider the following game (Spence 1974): Player 1 (a 
worker) chooses an education level ¢ and then demands a 
wage w from player 2 (a firm). Thus, 


A, = {(e,w)} © R?. 


29, See Bjnks and Sobel 1987 and Cho and Sobel 1987 for a similar approach 
and McLepnan 1985 for a different approach. Cho and Kreps 1987 contains a 
very helpfhil discussion of the link between the various refinements and of their 
connection with the “stability criterion” of Kohlberg and Mertens (1986). Our 
version ofthe intuitive criterion follows Cho and Kreps. 

See Cp 1986 for an analysis of ideas related to the elimination of 
equilibriurt-dominated strategies in more general games, See Farrell 1985, 
GrossmanJand Perry 1980, and Okuno and Postlewaite 1986 for substantially 
different rffinements. 
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30, See Kreps 1984 and Cho and Kreps 1987, Cho and Sobel (1987) identity 
simple conditions (including the single-crossing property and the property that 
senders of all types have the same ordering conceming the receiver's response 
to a given message) such that the signaling; game has a unique stable equilibrium 
(this equilibrium is then the one selected by the elimination of equilibrium- 
dominated strategies). 
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L or H), where L > 0, The worker knows his 
id the firm has prior beliefs p,(L) = x and p,(H) 


be the mean or average productivity computed from the 
prior bpliefs. The posterior beliefs are denoted 


P (Lia) = (a) 
and 
P,(Alaq) = 1 — (ay), 


The warker invests ¢ > 0 in education. For simplicity. we 
assume that the worker's level of education has no in- 
fluence] on his productivity. The cost of education to the 
worker] however, depends on the worker's type. A more 
productive worker learns at a lower cost (this condition 
is often called the single crossing, sorting, or Spence- 
Mirrleds condition). For instance, let us assume that the 
cost off education is e/t,. The payoff for player 1 with 
type t, |receiving wage w is 


T= p—e/t,, 


The indifference curves for the two types are represented 
in figurp 11.13. Type L’s indifference curve is steeper than 
type H's because a given increase in education is more 
costly fo type L and, therefore, requires a higher wage 
raise ta keep this type at the same utility level. 

The |firm accepts the offer only if w < E(t,\a,). Any 
w < L|is always accepted for any level of education (in 
particular, 0), and any w > H is always refused. 

First|we will consider the perfect Bayesian equilibria of 
this game. We will look at the two potential types of 
pure-sttategy equilibria (separating and pooling), and we 


Figure 11.13 


will examine whether or not these equilibria satisfy the 
two criteria discussed above. 


Separating equilibrium The two types of workers choose 
two different levels of education. Type f, receives wage 
1,3 The low-productivity type, L, necessarily chooses 
e(L) = 0. (If he invested e> 0, his utility, equal to L 
— e/L, would be lower than that obtained for no educa- 
tion, which is at least L.) The high-productivity type 
chooses education ¢ > 0, Let us define the levels of 
education 0 <s < rby 


L=H—s/L 
and 
L=H—w?H. 


In words, the low-productivity type is indifferent be- 
tween not investing in education and being recognized 
(getting wage L) and investing s in education and being 
taken for a high-productivity type (able to demand wage 
H). And the high-productivity type would not want to 
invest more than r to be recognized (while he could get a 
wage at least equal to L by not investing). Clearly, a 
separating PBE has education level ¢ in the interval |s, rj, 
because it must satisfy the incentive-compatibility con- 
straints L > H — e/L and H — e/H > L. Conversely, any 
¢ in |s,r] is part of a PBE. It suffices to specify that for 
out-of-equilibrium education levels e’ not in {0,e}, the 
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31. One could also think of equilibria in which E(t,|a,) > 20(4,); however, such 
‘equilibria actually do not satisfy the refinement criteria. 
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puts all the weight on the low-productivity type 
=1. Thus, the worker with education e’ cannot 
id more than L, and, as is easily verified, type L 
ses education 0 and type H chooses education e. So 
ave a continuum of separating equilibria. 

lowever, only one separating equilibrium survives once 
liminate weakly dominated strategies when forming 
f-equilibrium beliefs. For this purpose, notice that 
e strictly greater than s is dominated by education 
0 for type L. (By investing 0, the low-productivity 
er gets at least L; by investing e, he gets at most 


H + e/L <L) In particular, any e in (5,r| should lead to 
n(e)|= 0 (so that w = H can be demanded). Thus, to be 
recdgnized, type H need not invest more than s, and we 


ft with a unique separating equilibrium, in which the 
-productivity type invests at the “least-cost sepa- 
ing equilibrium level” s. (This equilibrium also satisfies 
Intuitive criterion.) 


ng equilibria There also are a lot of pooling PBEs, 
ose that both types choose education level ¢. The 
sponding wage that can be demanded is M. To 
rce such an equilibrium, one is best off choosing 
f-equilibrium beliefs n(e') = 1 for e’ # e (so that the 


e/H > L. Hence, any education level ¢ satisfying 
e/L >L defines a pooling equilibrium, Thus, we 


jowever, if the firm is convinced by the high-productivity type’s speech 
low-productivity type would never deviate to point B, it should also 


.. Hence, it is not clear that it can be common knowledge that a deviation 
ignals the high-productivity type. (This argument was offered by Joseph 
.) See Cho and Kreps 1987 for a discussion of this point. 


jor an alternative method of selecting this equilibrium, see Riley 1979. 
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A = (e, w}. Suppose that player 1 deviates and chooses 


B= {e+ de,w + dw). 


Point B involves more education, and a wage raise that 
more than offsets the increase in education for the high- 
productivity type but not for the low-productivity one. 
Choosing B is thus equilibrium-dominated for type L (but 
not for type H). So, after point B, the firm should form 
beliefs » =0 and expect profit H —(w + dw), But w 
<M<H, s0 for dw small the firm should accept offer 
B. And, therefore, type H should pick B rather than A.*? 
Hence, the pooling equilibrium at A does not satisfy the 
intuitive criterion. (Actually, the intuitive criterion does 
not always eliminate all pooling equilibria. A counter- 
example is another signaling game, the limit-pricing 
game—see Fudenberg and Tirole 1986c. Other examples 
can be built in more complicated games, such as bar- 
gaining games.) 

Hence, in this game the intuitive criterion selects a 
unique pure-strategy equilibrium. The intuitive criterion 
also disposes of the mixed-strategy (or hybrid or semi- 
separating) equilibria**"** 

We shall now develop an industrial-organization exam- 
ple with a very similar structure. The treatment closely 
follows that of the job-market signaling example. 


Example 2 


Consider the Cournot game with asymmetric information 
used to illustrate the notion of Bayesian equilibrium in the 
text, but played in a sequential rather than simultaneous 
manner (Gal-Or 1987). There are two firms (i = 1, 2). 
Firm or player 1 chooses output a, = q,. Firm or player 2 
chooses output a, = q> after observing q,. Payoffs are 


T= hh — (q + 9a)lai, 


34, The equilibrium chosen by the intuitive criterion or the stability criterion is 
discontinuous in the firm's beliefs about the worker. When a = 0 (that is, when 
it is common knowledge that the worker is of type H), the worker does not 
invest in education and obtains wage H. When 2 is strictly positive, the type-H 
worker still gets wage H but must invest ¢ = s in education (where s is constant 
in. and therefore does not converge to 0 with, 2). This discontinuity is a general 
property of the Spencian signaling model. For more on this issue, see Fudenberg, 
et al, 1986. 


35. To see this, begin by duplicating the reasoning used above for pooling 
equilibria. 
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where +, cin be thought of as the intercept of the linear 
demand e (minus a common unit cost, say). If f, is 
common khowledge (the full-information case), we know 


that firm 2]has reaction function 
Fa = Ralg) = (th — qy)/2 


This leads)to q; = t,/2 and q,=,/4. Profits are 11! 
= (t,)?/8 and TI? = (t,)?/16. 

Now asgume that firm 1 (the incumbent) has superior 
information| about demand. Before choosing q;, it learns 
t,. This parameter or type can take the value L or H, with 
0 <1 <H Firm 2 has prior probabilities p,(L) = a and 
pi(H) = 1} a. Before choosing q3, it observes only cn 
and update its beliefs: 


Pr(Llqu) = (qu), 
Ps(Alqy) =|1 — n(q,). 


Clearly, |firm 2 reacts optimally given its posterior 
beliefs, so it maximizes 


4a {m(qy dL 
The optimal reaction, 

92 = Rola) = (nquL + [1 — nq IH — q,}/2, 

grows with/firm 2's beliefs that demand is high. Hence, 


{1 — lq H} — a, — 42). 


firm 1 has ah incentive to convince firm 2 that demand is 
low. 
Let us first derive the “monotonicity property” from 


the incentive-compatibility conditions. Roughly, firm 1 
chooses a higher quantity when demand is high.*® Let q, 
and q; denate optimal actions for types L and H, Tespec- 
tively. (We bllow the possibility that several such actions 


exist.) Optimality requires that 

4(L — 4, — Raq.) > gil — gi — Ralqi)) 
and 

4i(H — 4; — Ry(qi)) > 4x(H — gy — Ra(qi)). 
Adding these two inequalities yields 

(41 — 4 )(H— L) > 0, 


which yields the desired monotonicity. 

We now look for (pure strategy) separating and pooling 
equilibria. We will impose a number of conditions. These 
conditions are met for, e.g., H = 4, L = 3, and a= 08 
(these numerical values simplify computations), 


Separating equilibria In a separating equilibrium, the firm’s 
type is revealed by its output. Type H, therefore, plays its 
full-information quantity H/2.*” The incentive-compatibi- 
lity constraints require that type f, not want to choose the 
quantity chosen by type #{. Clearly, type L does not want 
to choose H/2. First, H/2 is not profit maximizing for 
type L under full information about L; second, playing 
H/2 conveys the information that demand is high, and 
leads to a greater output for firm 2 than under full infor- 
mation, The relevant incentive constraint is thus that the 
high-demand type not want to play the low-demand 
type’s quantity q,; that is, 


H? L— 
p> (Ha —2S4), 


To make things more interesting, let us assume that con- 
dition 11.18 is violated at type L's full-information output 
(q; = L/2): 


(11.18) 


(11.19) 


36. This conditlon. which can be derived in a much larger class of games 
(including the pfevious job-market signaling game), stems from the Spence- 
Mirtlees conditidn, which can be written here as 


2) 
et, \eqy y 


37. If type H ware to play q) in equilibrium. its profit would be 
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H/ Bs =i 
(x Hh 


where the first inequality comes from the full-information maximization and the 
second from the fact that the expectation of f, conditional on firm 1's playing 
H/2 cannot exceed H. 
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It is easily checked that 11.19 is satisfied if L is sufficiently 
close to H. |(The intuition is that at L ~ H the change in 
output by H to claim he is type L has only a direct 
second-order effect on his profit but yields a first-order 
firm 2's output and, therefore, an indirect 
-ase in firm 1's profit.) Inequality 11.19, the 


lieves that demand is high. From 11.20, type L 
laying q,. From the definition of H/2, type H 
laying H/2. Thus, there is a continuum of sepa- 


prefers 
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M—4a\_ _M_4a@ 
a(t ar )=a(s ae} ig 


The best way to sustain q, as a pooling output of a PBE 
is to assume that firm 2 believes that demand is high 
when it observes qj # q,. So q; will indeed be a pooling 
equilibrium output if and only if 


a(u-4—) > mex[ s(t -2-"=*)| 


(11.21) 


= H7/8. (11,22) 


As is easily checked, 11.22 defines an interval of allow- 
able q, that includes H/2. Inequality 11.21 also defines 
an interval located to the right of r. Indeed, given our 
numerical values (H = 4, L = 3, « = 0.8), this interval 
also contains H/2. Thus, the set of pooling outputs is an 
interval containing H/2. 

To eliminate this continuum of pooling equilibria, we 
can use the intuitive criterion. Let q, be a pooling equilib- 
rium output. Define q; < 4, by the smallest root of 


L—4 M— 
#i(H#— a —*S#) = 44-4 - = a) 


(11.23) 


Now playing 4; — « (for & positive and small) is equilib- 
rium dominated for type H, but not for type L. So firm 
2's posterior beliefs should put all the weight on type L 
following output 4; — «, But, from 11.23, type L prefers 
playing qj — ¢ to playing q,. Thus, q, is not a pooling 
output anymore. 


Exercise 11,13** Consider the Stackelberg game, except 
that f, is continuously distributed on the interval [L, H] 
instead of having only two atoms at L and H. Look 
for a separating equilibrium. Type t, chooses output q, 
= Q,(t,), where Q, is strictly increasing and differentia- 
ble; the inverse function of Q, is T. Thus, T(q;) is the type 
that chooses output 4, - 
(i) Show that T satisfies the differential equation 


9. T (gi) = Tigi) — 24. 


hat is the boundary condition? Check that the 


p+m(t2) 


Exercis 11.14**** We saw in chapter 4 that, in the ab- 


retailer{s profit through a lump-sum transfer (the franchise 
fee). This exercise shows that when the manufacturer has 
private information about the demand for his product, he 
may want to charge more than the marginal cost (and 
reduce] the franchise fee) for signaling purposes. A mo- 
nopoly manufacturer with marginal cost c charges a two- 
part tafiff to a monopoly retailer: T(q) = A + pyq. where 
q is th¢ quantity bought and resold by the retailer, p,. is 
the int¢rmediate price, and A is the franchise fee. The final 
demand for the product is q = !, —p, where p is the 
consunjer price chosen by the retailer. For simplicity, the 
cost of Fetailing is zero. Player 1 (the manufacturer) moves 
first ard offers a contract a, = {A,p,}. Player 2 (the 
retailer) accepts or refuses the contract and, if he accepts 
it, chodses a consumer price. So a, = {yes or no, p}. He 
accepts| the contract if and only if his expected profit is 
non-nepative. 

(i) Rgderive the result that under full information about 
1, the ehuilibrium contract is p,, = cand A = (t, — 07/4. 

(ii) Shppose that only the manufacturer knows #,. This 
parameter (type) can take value L or H (0 < L < H), The 
retailer|learns +, after signing the contract but before 
choosing p. Reanalyze examples 1 and 2 to show that 
type L kharges an intermediate price equal to c and type 
H charges an intermediate price strictly exceeding c. (Show 
intermediate price is a nondecreasing function 
ook for separating and pooling equilibria. Use 


O_o 
Answers and Hints 


Exercise 11.1 


(i) There are three information sets. 
(ii) This is straightforward. 


Exercise 11.2 


(i) Let b = max;,;b,. Varying b; in (b, +00) has no 
effect on bidder i’s welfare; he gets the object and pays b 
in any case. Similarly, varying b, in [0,b) has no effect on 
his welfare. Suppose that v; > b. Then bidding b, <b 
rather than vu, reduces bidder i’s welfare by (v, — b) — 0 
> 0. If v4, <b, bidding b; > b reduces his welfare by 
0 — (v, — b) > 0. If un, = b, bidder j is indifferent between 
getting the object (at price b) and not getting it. He may 
as well bid b, = ;. 

(ii) Straightforward. 

(iii) The reasoning in answer i was independent of the 
level of b, so bidding b, = v; is optimal for any b. There- 
fore, bidding the true value is always optimal (and a 
dominant strategy) in a second-bid auction. 


Exercise 11.3 
(i) Social welfare is equal to 
LM =Lat Ysila.a) 
= —C(a) + ¥ g(a, 0,) + constant, 


from the government's budget constraint. 
(ii) With these transfers, and for any announcements 


(6,,.. 


by the other consumers, consumer i's payoff function is 
£(6,,6_.) + sila", 0-)), 0) 
=K, + ¥, g(a", _), 8) + g(a*(@,, 0_,), 0) 
ivi 
— C(a*(d,,0_,)). 


But from the definition of a* we have for all a, 


38. This exercise stems from discussions with Eric Maskin, 
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This mech 


, in general, there exists no dominant strategy 
that implements the first-best allocation and 


(ii) Suppose first that the three firms are not located at 
the same |point. A firm located alone at the left or the 


its rivals. Thus, firms must be located identically and 
get one-third of the market each. However, by moving 
slightly either to the right or to the left of this common 
location, a firm can get approximately half of the market, 
or more. So no pure-strategy equilibrium exists. On the 
existence of a mixed-strategy equilibrium, see Dasgupta 
and Maskin 1986. 


Exercise 11.6 


Let P_, = Yjyip;- Consumer i maximizes 


g(P_+ Pi) — Pr 
which yields 
g(P4 +p) =1. 


This first-order condition is sufficient, and defines a unique 
optimum as the objective function is strictly concave 
and g'(0) > 1, g'(+0) <1. The total contribution (P 
= P_, + p;) is thus given by 


g'(P) =1. 


There is clearly too little public expenditure (due to the 
free-rider problem). The optimal public expenditure, P*, 
maximizes {ng(P) — P}, so that g'(P*) = 1/n, implying 
P* > P. Although the total expenditure P is determinate, 
the individual ones are not. Any {p,,..-,Pn} such that 
py +27: + p, = Pis an equilibrium. 

A multiplicity of equilibria similar to the one in this 
exercise arises when the size of the public good is fixed 
(for example, a project is implemented only if P > P), 
However, multiplicity does not arise in all models of 
private provision of a public good.*” 


Exercise 11.7 


Let T= 2. Player 2 makes the last offer, so he offers 
x; = 0, which is the lowest share accepted by player 1. 
Player 2 thus gets 1. He will accept giving x, to player 1 
in period 1 only if 1 — x, > 6(1). Thus, player 1 offers 
x =1—<6. 

Let T = 3. We know that if player 2 makes the offer in 
period 2, he gets 1— 4, because there are exactly two 
periods left. So player 1 in period 1 must give him at least 


fixed point. What happens if the consumers have different gis instead of 
different Cs? 
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.24 and 11.27 give 
6+ 6°V, = V, > 11 + 6), 


player 1 makes the offer at some date. Because 
gets at most V; in the next period (perfectness 


(11.26) 
(11.27) 


(11.28) 


(11.29) 
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Similarly, 
W, = W, = W, = 61 + 8). 


The same equations hold for player 2. Since valuations are 
unique, it is easy to see that the equilibrium strategies are 
also unique, and that they are those described in the text. 


Exercise 11.9 


(i) The game tree has three nodes, which are also infor- 
mation sets, Use Kuhn’s algorithm of backward induction: 
The authority inspects if and only if it observes predation. 
Hence, the firm does not prey in equilibrium. 

(ii) The game tree is the same as in question i, except 
that the two nodes following the firm’s decision form a 
single information set. If the firm preys in equilibrium, the 
authority inspects and thus the firm ought not to prey, If 
it does not prey in equilibrium, no inspection occurs, so 
preying would be profitable. Thus, there exists no pure- 
strategy equilibrium. Suppose that the firm preys with 
probability x and the authority inspects with probability 
vy. Because the two parties must be indifferent between 
their two pure strategies, x and y are given by 


x(s — c) —(1 — x)e = Oemxrs=c 

and 

—uip — 8) + (1 — vg = O<> yp =x. 

Exercise 11.10 

A bidder with valuation t maximizes 

(t — b)F*-*(@(b)). 

The first-order condition is thus, after simplification, 


F(@(b)) 
(n — 1)f(@(b))" 
This condition must be satisfied at t = (h), which yields 


a first-degree differential equation in (b) 
For F(x) = x", one obtains (b) = kb, where 


(t — b)@'(b) = 


k=1+ 


a(n — 1)" 


The bid tends to the true valuation (ie. k tends to 1) 
when the number of bidders tends to infinity. 
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This is a mitch more general result; see Milgrom 1981b. 
(iii) Suppose that «(b) = kb. Writing the first-order 

condition dnd imposing (b,) =x; one obtains k = 2 

— 1/k, or = 1. Each bidder bids his private information. 


Exercise 1.12 


(i) Solve] by backward induction. If the single entrant 
enters, the firm is better off acquiescing. Hence, the en- 


3> 0. Can the soft incumbent prey with 
1? Then after predation at date n — 2 one has 
. Recall that the entrant stays out at n — 1 if 


X_-1 > 4. So if x,-2 > 4, the incumbent preys with prob- 
ability 1. If x,-. <4, the soft incumbent must randomize 
between preying and acquiescing, so that x,-; = }: Let 
Y,-2 denote the probability of predation by the soft in- 
cumbent. From Bayes’ rule, 


1 (I— 


De n—2 


2 (L=xq-2)Yn-2 + tra 


The entrant at date n — 2 is willing to enter if 
(=D) lay-2 + (1 —4q—2)Yn—2l + (DICT — qa) (1 — Yn~a)) 
2 0, 


Using the previous Bayesian updating rule, this inequality 
can be written as 


Mua SA: 


(For x, = 4, the entrant randomizes.) More generally, 
entrant 1 stays out if x > 1/2"~! and enters if x < 1/2""'. 


Exercise 11.13 
(i) Firm 2 reacts to q, by 


Tq) — 
ae - 4 


since q, reveals that firm 1's type is T(q, ). Hence, type f, 
maximizes 


Tq) — 
a(t 4% - ah an), 


The first-order condition with respect to q, must be satis- 
fied at type t; = T(q;) for firm 2 to have rational expec- 
tations. This yields the differential equation. It is easily 
checked (using the first-order condition as an identity) 
that the second-order condition is satisfied. 

(i) Type H might as well choose its full-information 
output, H/2; so T(H/2) = H. 

(iii) For instance, for H = 4 and L = 3, s = 1 and T(1) 
=2+2In2>L. With lots of types, there are more 
incentive constraints than with only two types: Type H 


40. To prove fhis formally, take two values x, and x) and write the “incentive- 
tions”: The optimal bid for x, yields at least as much payoff 
bid for x; when the information is x); and conversely. This is the 
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method of proof that we used in chapter 1 to show that a monopoly price is a 
nondecreasing function of marginal cost. 


Chapter 11 


must ndt play type (H — ¢)'s output, the latter type must 


not play type (H — 2e)'s output, and so on. Type L 
needs ah even lower output to separate from type H, for 
exampl. 
Exercise 11.14 

(i) retailer maximizes 
(P— pub. — p) — A, 


The marjufacturer then maximizes 
A(Pw) 


(ii) Tojobtain the monotonicity, write the two incentive- 
compatibility constraints. In equilibrium, the low-demand 
type is fevealed, so efficiency requires that p, = c. But 
the hightdemand type “proves” that demand is high by 
choosing p, > c. This is because a positive margin is 
more important when demand is high than when demand 
is low. Sp the low-demand type is less tempted to forgo 
the redu¢tion in the franchise fee required to claim that 
demand {s high in exchange for a positive margin. 

To eliminate the pooling equilibria using the intuitive 
start from a pooling contract { p,,,.A} and con- 
sider a npw contract with a slightly higher intermediate 
price andja slightly lower franchise fee, which type H (but 
not type LL) prefers to the pooling contract. 


(Pw — Olt — py)/2. 


ee 
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Review Exercises 


Some of the following exercises merely review or apply 
ideas infroduced in the text; others use techniques devel- 
oped there to analyze new and interesting questions. 

The pracketed numbers refer to the most relevant 
chapter¢ however, the reader should feel free to use ideas 
developed in other chapters or not developed at all in the 
present book. (The chapter on the theory of the firm is 
referred to as [0], and an exercise that appeals to many of 
the chapters does not refer to any particular one.) 

Some|of these exercises are taken from MIT problem 
sets and} examinations prepared by J. Harris, P. Joskow, 
G. Salonpr, R. Schmalensee, and the author, 


Exercisd 1 [0]** 


(i) Define “relationship-specific investment” or “trans- 
action-specific investment” (terms used by Williamson, 
Klein, Crawford and Alchian, and others). 

(ii) In 4 simple two-period model, show that if an input 
supplier Who must make relationship-specific investments 
ex ante cannot contract on the price at which transactions 
will take |place ex post, the result will be underinvestment 
in relationship-specific capital. 

(iii) Hpw might the above result change if both the 
buyer anf the seller were required to make relationship- 
specific investments ex ante? 

(iv) Sqveral alternative explanations for why a firm 
might finl it advantageous to integrate forward or back- 
ward, or fo rely on complex contractual arrangements to 
mediate dxchange between levels of the production pro- 
cess, havp been proposed. Discuss one or more alter- 
natives tq the “relationship-specific investment” rationale 
for vertichl integration. What type of empirical evidence 
would you look for, and how would you use it to 
distinguish between competing explanations of vertical 


demand q = 1 — p, where p is the final price. The retail 
cost is 0, and the manufacturer's cost is C(q) = q?/2. 

(i) What is the aggregate profit under vertical inte- 
gration? 

(ii) What are the profits TI and I" under the optimal 
linear tariff, T(q) = p,q, for the manufacturer? 

(iii) What are these profits under a two-part tariff? 


Suppose now that before the manufacturer chooses the 
tariff, the retailer can choose to invest to increase demand, 
At cost ¢ (where ¢ is small and positive), the demand is 
increased from q = 1—p to q = 2 —p. (At cost 0, the 
demand remains q = 1— p,) The investment choice is 
observed by the manufacturer, who then chooses a tariff 
(so the manufacturer cannot commit before the retailer's 
investment). 

(iv) What is the level of investment under a linear tariff? 

(v) What is the level of investment under a two-part 
tariff? Does the manufacturer make more profit than 
under a linear tariff? Explain. 


Exercise 3 [1]** 


Consider a monopolist facing a linear demand curve q 
= a — bp and producing at marginal cost c, 

(i) Show that the elasticity of demand (defined as a 
positive number) is an increasing function of b. 

(ii) Compute the welfare loss due to monopoly pricing. 
How does the relative welfare loss (dead-weight loss over 
welfare under marginal cost pricing) vary with b? 


Exercise 4 [1]** 


A monopolist has constant marginal cost c and faces a 
twice-differentiable demand of q = D(p,d), where d is a 
demand parameter (@D/éd > 0). Let p™(d) denote the (a?) 
monopoly price for parameter d. 

(i) Show that a sufficient condition for p™ to be a 
nondecreasing function of d is that @?D/épéd > 0. First 
prove this by assuming that the monopolist’s profit is 


concave in| price. Then prove it more generally. (The 
second-ord¢r condition may not be satisfied, so the 
monopoly "es may not be unique for a given demand 
parameter. Show that any monopoly price for d’ does not 
exceed any] monopoly price for d when d’ < d. Use the 
reasoning that was used in chapter 1 to prove that the 
monn increases with marginal cost.) 

(ii) Discuss the sufficient condition. Give a simple de- 
mand curvd for which it is not satisfied and for which the 
monopoly price does not depend on d. 


Exercise 5|[1]* 


Consider a|monopolist facing demand q = 1 — p in each 
of two periods, Its marginal cost is equal to c in the first 
period and c— 4g, in the second period (where 4 
is positive|and small”). Compute the optimal pricing 
strategy fof the monopolist. Compute the Lerner index 
in each pefiod and comment. Does output increase or 
decrease over time? 


Exercise 6][1, 2, 3]** 


In his pap¢r Demand Uncertainty and Sales” (mimeo, 
Graduate Tool of Business, University of Chicago), 
Peter Pashipian lists a number of different types of sales 
and some |explanations for these sales. Consider pre- 
season salef (e.g, August sales of winter clothes), white 
sales, ei ant random sales, and clearance sales. 

Think of potential explanations (including ones not 
offered in fhe text) for these sales (uncertainty, informa- 
tion acquis|tion about consumers’ tastes, peak-load pric- 
ing, various forms of price discrimination, etc.), and give 
a critical asfessment of these explanations. 


Exercise 7|[2, 3, 7]* 


(i) A private railway company considers building a rail- 
road betwen cities A and B. No such service commonly 
exists; there may be alternative ways of moving passen- 
gers and freight (e.g. cars, trucks, and barges), but these 
are assume to be competitive if they exist. Using chapter 
2, compard the private company’s incentive to build the 
railroad with the social planner’s. 

(ii) Would you amend your conclusion if there existed 
an alternative and differentiated transportation mode run 
Fhonbadhe fal by the railways company (see chapter 2 
or 3) or anpther company (see chapter 7)? 
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Exercise 8 [2, 5]** 


In The Economics of Regulation (Wiley, 1971), Alfred Kahn 
argues that a “prominent imperfection that may make 
unrestricted competition particularly injurious to con- 
sumers is their own limited ability to judge the quality 
of products and hence to keep it at acceptable levels even 
when they have a wide range of competitive suppliers 
to choose from.” “Consumer protection,” he notes, “can 
be equally necessary when price competition is very in- 
tense. The decline in price to average variable costs can 
lead to a skimping on safety, reliability and frequency of 
service that consumers may have difficulty in detecting 
promptly...” (volume 2, p. 176) This exercise builds an 
example in which price competition lowers incentives to 
maintain a reputation (and may lower welfare). 

There are two periods (t= 1,2) and a single good. 
Consumers are all identical and have unit demands. The 
valuation for the good is v + s,, where s, is the quality of 
the good: high (s, = 1) or low (s, = 0), The extra cost for 
a firm to supply the high quality is c’. The production cost 
of the good is c. The quality cannot be ascertained at the 
date of purchase, and no warranty is feasible. However, 
all parties observe the date-t quality at the end of date f, 

A firm can be of two types: With probability 1 — x, 
the extra cost of supplying quality is c’ > 0. With prob- 
ability x, the low quality is not cheaper to produce (or 
else, the firm is “honest’). A firm knows its type, but the 
consumer (or the firm's rival, if any) does not. Assume 
that dx > c’ (where 6 is the discount factor). 

(i) Consider a monopoly situation. Show that the fol- 
lowing is an equilibrium path: The firm, whatever its type, 
charges p, = 0 + 1 and produces the high quality in the 
first period. It charges p, = v + x in period 2, and it 
produces the high quality in that period only if this is 
cheaper. Show that the expected welfare is equal to 


(v — (1+ 6) + 2(1 + 8) + (1 — 2) — cc’). 


(ii) Consider a duopoly situation, Show that, for any 
price charged by the firms in the first period, it cannot be 
the case that both firms supply the high quality with 
probability 1 in that period. (Hint: What would Bertrand 
competition imply in period 2?) 

Look for a symmetric equilibrium. Show that, if 4c’ 
> 5(1 — x), each firm produces the low quality in the 
first period if it is cheaper to do so, Compute the welfare, 
and show that if the above two conditions are satisfied. 
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and if] xd < 1—c’, welfare is lower in duopoly than in 
monopoly. 

ee 9 [2,5,7)* 

There pre two goods, The demand for good 1 is 

a = 4— bp, + dp, 


and " for good 2 is 


42 = 4 — bp, + dp,, 


where f: and b are strictly positive and |d| < b, The pro- 
Facto cost of each good is 0. 

(i) Alre the goods differentiated? 

(ii) Suppose that the two goods are produced by the 
same (a monopoly). Compute the optimal prices. 
Compare the Lerner index and the inverse of the elasticity 
of demhnd for each good. Comment. 

(ili) |Are the goods strategic complements, or sub- 
stitutes 

(iv) |The firms choose prices simultaneously, taking 
their riyal's price as given. Compute the Nash-equilibrium 
prices, 4nd compare them with the results in question ii. 


Exenciae 10 [3)* 


Assumd that a monopoly supplier of a good makes sales 
to custq¢mers located in different regions of the country. 
The demand functions for the good in each region are 


a=IPn 

and 

G2 = 27 Pe- 

Assume] in what follows that production and transport 


zero, 

juming, that the monopolist must charge a uni- 
ar) price to the two regions, calculate the profit- 
g uniform price. 

sume that the monopolist can engage in 


's third-degree price discrimination increase or 
jucers’ surplus, in this case? Is this a general result 


compares uniform monopoly prices with third- 
ice discrimination? 


(iv) Assume. that the monopolist is selling an inter- 
mediate product, and that the demand functions above 
are the derived demands for the intermediate product of 
two competitive downstream industries, If third-degree 
price discrimination is not possible, show that the monop- 
olist can achieve the same result by integrating forward 
into one of the downstream markets. Which one will he 
integrate into, and why? 


Exercise 11 |3]* 


There are two goods: the “basic good,” produced by a 
monopolist at no cost, and the “complementary good” 
(services), produced and sold by a competitive industry at 
cost ¢ per unit, Each consumer can consume (a) nothing, 
or (b) one unit of the basic good, or (c) one unit of the 
basic good and one unit of the complementary good. The 
consumer then has net surplus (a) 0, or (b) » —p, or 
(c) w — p —c, where w > v > O and pis the price of the 
basic good. 

There are two types of consumers: type-1 (“low 
demand") consumers, who have valuations v, and 1, 
for the basic good and the bundle, and type-2 (“high- 
demand”) consumers, who have valuations v, > v, and 
tw. Let (I — a) and denote the respective proportions 
of consumers. Assume that 


W,— 0, >¢> Ww, — 
and 
a(w, — c)< vy. 


In answering the following, indicate when you recognize 
an instance of first-, second-, or third-degree discrimina- 
tion. (Up to question v, the monopolist cannot tie the 
complementary good.) 

(i) What are the socially efficient consumptions of the 
basic and complementary goods by the two types? 

(i) Suppose that the monopolist knows which con- 
sumers are of type 1 and which are of type 2. What prices 
will he charge? 

(iii) Suppose that the monopolist cannot tell the cus- 
tomers apart. Show that the monopoly price still leads to 
the socially efficient allocation. 

(iv) Suppose that the monopolist gets a “signal.” Stu- 
dents are all of type 1. Conditional on a consumer's not 
being a student, the probability that he is of type 2 
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is B >. Assuming that B(w,—c) > v,, compute the 
optimal prices to students and nonstudents. Show that 
discrimimation leads to an inefficient outcome. 

(v) Rgturn to question iii (no signal), but assume that 
the monopolist can tie the complementary good. Show 
that he jsells the basic good alone at price v, and the 
bundle at price v, + (w, — v2), and that his profit in- 
creases 4s a result of the tie-in. 

For nore on these issues, see J, Ordover, A. Sykes, 
and R. Willig, “Nonprice Anticompetitive Behavior by 
Dominant Firms toward the Producers of Complementary 
Products,” in Antitrust and Regulation, edited by F. Fisher 
(MIT Prbss, 1985) 


Exercis¢ 12 [3]* 


is used $s an input in the production of numerous final 
goods. Hor purposes of this exercise, assume that there are 
only twp final goods that use aluminum. Each of the 
goods ‘ a different demand for aluminum. 


su monopoly supplier of aluminum. Aluminum 


(i) If axplicit price discrimination is not possible, show 
that the/monopoly input supplier will have an incentive 
to integiate into the production of at least one of the final 
goods. 

(ii) SHow that if the monopolist integrates into one 
final-godds market, it will integrate into the market with 
the most elastic demand. 

(iii) Why will final-goods competitors in the integrated 
market Hy “squeezed” out of business? 


Exercis¢ 13 [3]* 


Disneyland used to offer a variety of admission fees. For 
a given fee, the purchaser was entitled to a certain number 
of rides pnce in the park. (For example, there might have 
been an} “economy” package that cost $10.00 and in- 
cluded t}ckets for five rides and an “adventure” package 
that cos{ $15.00 and included tickets for ten rides.) 

(i) Uging diagrams and/or equations, explain why 
Disneyland’s management might have found this scheme 
profitable. Why didn’t they simply charge a fee per ride? 

(ii) There are numerous products for which there is 
a fixed |charge to use the service and then a charge 
per usage. Examples are Polaroid cameras and film and 
electrofax copying machines and paper. How does the 
optimal |pricing for these products differ from that of 
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Disneyland, and in what respects is it similar? Explain the 
reasons for the similarities and the differences. 


Exercise 14 [3]* 


(i) A “monopolist” faces a single consumer, with de- 
mand function q = a— p. His marginal cost is 0, The 
monopolist faces a competitive fringe at price po <a 
(that is, a perfect substitute is available at price po). What 
is the optimal pricing scheme under a linear tariff? Under 
two-part pricing? 

(ii) Suppose now that there are two types of con- 
sumers, with demands q = a, — p (in proportion x) and 
q = 4, —p (in proportion 1 — x), where py < a, < a3. 
Consider linear pricing and third-degree price discrimi- 
nation. Would uniform (linear) pricing increase welfare? 
(Think of the case in which a,/2 > po > a, /2.) How does 
the result compare with the Robinson-Schmalensee one 
on uniform versus discriminatory pricing under linear 
demands? 

(iii) Consider the two types of consumers of question ii, 
but look at second-degree price discrimination. Compare 
the monopoly price under linear pricing and the marginal 
price under a two-part tariff. 


Exercise 15 [3, 4]* 


Review the various motives for tie-ins. Explain how you 
might distinguish between them empirically. 


Exercise 16 [3, 4]** 


The Chicken Delight company (CD) owned the rights to 
a particular recipe for making fried chicken and the as- 
sociated trademark. CD manufactured nothing itself, but 
licensed franchisees to make and sell fried chicken in local 
retail outlets using the CD recipe and trademark, It 
did not charge a franchise fee; however, it required each 
franchisee to purchase a specified number of cookers and 
fryers and to purchase a variety of supplies (including 
cups and napkins) from CD. CD purchased these pieces 
of equipment and supplies from third parties. There was 
nothing unique to CD about the equipment and supplies. 

(i) Define the term tie-in sale, 

(ii) Why might a company like CD enter into the 
arrangement described above with its franchisees? 

(iii) How would you expect the prices CD charged its 
franchisees for the equipment and supplies it sold to them 
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to compare to the prices the franchisees would have paid 
had thty not been required to purchase from CD? 

(iv) |Why might CD have used a tie-in arrangement 
rather han a franchise fee? 

(v) Why might CD have used a tie-in arrangement 
rather than taking a share of each franchise's profits? 

(iv) Why might CD have required the franchisee to 
purchaje several types of inputs from it rather than just a 
single input such as the frying equipment or the mix for 
the chitken coating? 


448, F.td 43 (9th Circuit 1971) for more details, Chicken 
Deligh} was enjoined from engaging in these practices 
and suljsequently went out of business, 


If this ioe excites you, see Siegal v. Chicken Delight 


Exercige 17 (3, 7]* 


Consid¢r the model of differentiation on a line with unit- 
demand consumers located uniformly along a segment of 
length |t and facing transportation cost # per unit of dis- 
tance, and with two firms located at the two extremeties 
of the kegment and having marginal cost c, The firms 
competp in a Bertrand fashion. 

(i) “leh that under uniform pricing the consumers’ 
generalized cost varies between c +t and c+ 3t/2 (if 
consumpr surplus is sufficiently high). 

(ii) Skppose that the duopolists can price-discriminate 
depending on the location of the consumer (i.e., can each 
choose Jone delivery price per location). What type of 
price digcrimination is this? Show that in Bertrand equi- 
librium The consumers’ generalized cost varies between 
¢ + #/2 and c + t. How does this compare with the results 
obtained in the case of a monopoly in chapter 3? 


Exercisp 18 [3, 7]** 


Tim Bresnahan, in his paper “Competition and Collusion 
in the yicaioia Automobile Industry” (Journal of Econ- 
ometrics |35 [1981]: 457-482), estimates a noncoopera- 
tive mqdel of vertical product differentiation for the 
automobile industry. A car’s quality is taken to be one- 
dimensignal and is measured by a “hedonic function” 
of its Sena: (length, weight, horsepower, etc). 
Bresnahan finds that larger vehicles have larger price-cost 
margins land that larger vehicles are farther apart in the 
product-quality space. Can you think of models (or varia- 


tions thereof) that would explain either or both of these 
findings? 


Exercise 19 [4, 7, 8]** 


(This exercise follows Rey and Stiglitz’s 1986 mimeo- 
graphed paper “The Role of Exclusive Territories in 
Producers’ Competition” [Princeton University].) Two 
manufacturers (i= 1,2) produce imperfect substitutes. 
The demand for good i is q, = D,(p;, p,), and demands are 


symmetric: D,(-,-) = D,(,*). Let 
aD, re 

&( Py, P2) = — =—/— 

i ops] P4 
denote the own-price elasticity of demand, and let 
aD, |D, 
Ep, pz) = ——/— 

PP aa ps 


denote the cross-price elasticity. Let e(p) = e(p,p). The 
unit manufacturing cost is c. 

(i) Suppose first that the manufacturers distribute the 
good themselves (at no cost), Show that in Nash equilib- 
rium the two firms charge price p such that 


(p — 0)/p = I/e(p). 


(ii) Suppose now that each manufacturer i distributes 
his good through two retailers. Each of these two retailers 
carries only brand i and pays A; + Pw,4c to manufacturer i 
(where p,,, is the wholesale price). The retailers of a given 
brand are undifferentiated and face no distribution cost. 
All four retailers choose prices simultaneously in the 
second stage of the game. In the first stage, producers 
choose retail contracts simultaneously. Argue that A, 
must be equal to zero, and that the outcome is the same 
as in question i. 

(iii) Suppose now that each manufacturer uses a single 
(exclusive) retailer. (Equivalently, each could give exclu- 
sive territories to its two retailers.) Retailer i thus maxi- 
mizes 


(Pi — Pw,)D (Pu Pj) — A; 

over p,. Let p?(p,,, P»,) denote the Nash-equilibrium retail 
prices (obtained by solving the competition between re- 
tailers for given wholesale prices). Let 

apt ot 


Pw, | Pw; 


(Pwr Pws) 
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and 


_ opt |pt 
CP! Pws 


Pw,» Pw] 


denote i own- and cross-elasticities of retail-price re- 
sponses. Assume that m and mm are positive. Comment on 
this assumption. Argue that 


Aj = [PP (Prey Pwo.) — Po lDiC PF (Preys Pw, Pf (Powys Pro,))- 


Show thal the first-period equilibrium in contracts be- 
tween the| manufacturers implies a symmetric retail price 
satisfying 


What do¢s this suggest concerning the private desir- 
ability of fetail competition in this model? What strategy 
in the taxpnomy of chapter 8 does the choice of monop- 
oly pale resemble? 


Exercise 2.0 [5]* 


Consider h duopoly with concave inverse demand func- 
tion P(q,|+ 42) and convex costs C(q,). Consider the 
standard $tatic Cournot game. Prove that firm i’s best- 
reponse facto R,(q;) is monotonic in q; and that the 
slope of K,(q)) is between —1 and 0. 


Exercise 2.1 [5]* 


Discuss vprious rationing rules, their relevance, and why 
the choic¢ of the rule may matter when one is formalizing 
oligopoly} pricing. 


Exercise |22 [5]* 


(i) Con}pute the Cournot equilibrium with n firms when 
the firms) face the inverse demand function p = P(Q) 
= Q-“ (with ¢ > 1) and have identical constant marginal 
cost c. 

(ii) Shpw that each firm's profit goes down when c 
goes up. 

(iii) Tdke n = 2. Show that q, is a strategic comple- 
ment for fy if 0 < qz < (1/e)qy. 


Exercise|23 [5, 6]* 


Consider] an industry that produces a homogeneous 
good. Amy firm can produce as much as it wants at a 


constant marginal cost of c per unit. Assume that all firms 
have price as the strategic variable. 

(i) If the firms choose prices simultaneously and non- 
cooperatively and the market lasts for only one period, 
what price will prevail? Why? 

(ii) Suppose that the firms meet repeatedly. Can the 
firms “tacitly collude’? (Describe various theories of 
repeated interaction and the stylized facts they are meant 
to explain.) 


Exercise 24 [5, 7]** 


In chapter 7 we saw that a free-entry equilibrium in gen- 
eral may involve too much or too little entry in an in- 
dustry. This exercise follows von Weizsacker 1980 and 
Mankiw and Whinston 1986 (see chapter 7 for refer- 
ences) to study the level of entry in the special case of a 
homogeneous-good industry. We start with a Cournot 
example, and then sketch Mankiw and Whinston’s more 
general approach. 

Suppose that firms face an entry cost f, and that entry 
is free, The industry inverse demand function is p = P(Q), 
where Q denotes aggregate output. A firm that enters has 
cost function C(q), where q is individual output (C(O) 
= 0, C'(q) > 0, C"(q) > 0). Ina symmetric equilibrium in 
which n firms enter, the output per firm is q(n) (so that 
aggregate output is Q(n) = nq(n)). 

Compare the free-entry equilibrium not with the first- 
best optimum (which, because of product homogeneity, 
would have a single firm, pricing at marginal cost if 
C” = 0), but with the socially optimal number of firms 
subject to the constraint that the social planner is unable 
to control the behavior of a firm once it is in the market. 
Let n° and n* denote the number of firms in the free-entry 
equilibrium and the constrained welfare optimum. In ques- 
tions i and ii, assume that these are real numbers (i.e., 
ignore the integer constraint). 

(i) Assume that P(Q) = a — bQ and C(q) = cq. Show 
that if entering firms compete in a Cournot fashion, then 


ta 1 ae 
edb 
and 

(a— 0)? 
© Diao c= 
(n° + 1)? =(n* + 1) ih 
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In the [free-entry equilibrium, is there socially too little 
entry, @r too much? 

(ii) Fpllowing Mankiw and Whinston, make the follow- 
ing mofe general assumptions concerning post-entry mar- 
ket behavior: 

AI: Q(t) increases with n, 

A2: q(n) decreases with n (business-stealing effect). 

A3: P(P(n)) — C'(qin)) > 0 (market power). 

Show that the profit per firm decreases with the number 
of entefing firms (n° is such that this profit is equal to 
zero). Next, show that the social welfare, 


Qn) 

[ P()dx — n C(g(n)) — nf, 

0 

is decrensing at n = n°. Then show that n° > n”*, 

(iii) Reintroducing the integer constraint, Mankiw and 
Whinstin show that n° > n* — 1. Can you think of an 
homog¢neous-good industry in which n* <n"? (Hint: 
Recall chapter 2.) 


Exercist 25 [5]*** 


cient rationing, the outcome of the two-stage capacity- 
and-ther-price game is the Cournot outcome for the 
following, demand function: 


Go aon the Kreps-Scheinkman proof that, under effi- 


1 forp<1 


Dinh 0 forp>1 


Comput the reduced-form profit functions and the equi- 
librium (pure or mixed) strategies. 


Exercisd 26 [5, 7, 8]* 


A great ive of empirical work in industrial organization 
has focused on the estimation of relationships such as 


(Profit rake), 
= a + b- (Concentration ratio), 
+ f*(Minimum efficient scale), 
+ fl (Advertising/sales ratio), 
+ + (Capital/output ratio),, 


where each variable is measured at the industry (i) level 
and wheje observations for n industries are used to esti- 
mate the|relationship. 


(i) Define the concentration-ratio and minimum-efficient- 
scale variables. 

(ii) Why are economists interested in estimating rela- 
tionships such as this? 

(iii) Is industry concentration a good proxy for the 
degree of competition in an industry? 

(iv) What are the other right-hand-side variables sup- 
posed to pick up in this relationship? 

(v) Can you interpret such a relationship as causal? 


Exercise 27 [7, 8]** 


Michael Whinston, in “Tying, Foreclosure, and Exclusion’ 
(mimeo, Harvard University, 1987), argues that bundling 
may deter entry in a market. 

(i) Consider the following model: There are two com- 
pletely unrelated markets and two firms. Market A is 
monopolized by firm 1, Consumers have unit demands. A 
fraction x (respectively 1 — x) of consumers has valuation 
D (respectively ») for good A with > v > ca (where cy 
is firm 1's unit cost of production for good A). Assume 
that the firm cannot perfectly price-discriminate and that 
(8 — ¢,) <2 — Cy, Market B is served by firm 1 and 
firm 2. It is a differentiated market on the line. Transporta- 
tion costs are linear (t per unit of distance). The density of 
consumers is uniform and equal to 1 along the segment of 
length 1. Firms 1 and 2 are located at the two extremes of 
the city, Their marginal cost for good B is cy. Consumers 
have unit demands in market B as well, and the demands 
in the two markets are independent. (A consumer's prob- 
ability of having valuation 3 for good A is independent of 
his location in market B.) Let py be firm 1's price in market 
A. and let p, and p, be firm 1's and firm 2's prices in the 
differentiated market. Suppose that firm 1 sells the two 
goods separately. What is p,? Compute the reaction 
curves p; = R,(p;) in market B, What is the Nash equi- 
librium in market B? 

(ii) Consider the following timing: Firm 1 can “bundle” 
its two goods or not; firm 2 enters (at its extremity of the 
city) and pays an entry cost F or stays out; the firms 
choose their prices simultaneously. Bundling is a techno- 
logical decision that forces firm 1 to sell the two goods 
together (it then charges a single price 7). If firm 1 does 
not bundle, it sells the two goods separately, and the 
equilibrium is as before. (For simplicity, we will not con- 
sider mixed bundling.) Suppose that firm 1 bundles and 
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firm 2 enter}. Show that there exist two cutoff locations 
on the line. $how that firm 1's demand is 


[p2 — ( — p*) + H/2b 


where v° =|rd + (1 — x)v. Show that firm I's new reac- 
tion curve s4tisfies 


Ry(p2) < Rips) + v. 


Informally determine whether bundling is good for entry 
deterrence ahd for entry accommodation. 


Exercise 28 [8|* 


Consider an] industry consisting of three firms. Each firm 
has the sam¢ cost structure, given by 


C(q;) = 5 +24 


for firm i. mga demand is given by the inverse de- 
mand functipn, 


P(Q)=18+ 2 


where Q =|q, + 42 + 43. The production timing is as 
follows: Firm 1 produces its output first. Knowing firm 1's 
output, firm|2 produces its output. Knowing both firm 1's 
and firm 2’s|output, firm 3 then produces its output. Each 
firm knows|that this is the timing of production; thus, 
firm 1 (for example) knows when it makes its production 
choice that |it will be followed by firms 2 and 3, The 
industry demand and cost functions are known to each 
firm. 
What will the equilibrium values of q,, q2, and q3 be? 
What standard duopoly model is this an extension of? 


Exercise 29 [8]** 


(i) Describe the use of various strategic effects to deter 
entry or aqcommodate a rival in two-period settings. 
(Think of puppy dogs and other cute animals.) 

(ii) Answer the following (in words) and explain: 


+ A foreign} firm that competes in prices with a domestic 
firm in the Homestic market suffers from facing a quota. 
True or falsp? 
+ Two firms are in the market in period 1, Firm 2 does 
not know fifm 1's marginal cost. Firms compete in prices. 
Is cutting slice a good strategy for firm 1 if it wants to 
drive firm 2Jout of the market? If it wants to accommodate 
firm 2? 
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+ “Advertising is like capital. An incumbent should over- 
invest to deter entry.” True or false? 

+ “The principle of differentiation (say, in spatial compe- 
tition) is an instance of a puppy dog effect.” True or false? 


Exercise 30 [8]** 


There are two firms, with demand functions D, = 1 — 2p; 
+ pj. Firm 2's (the entrant's) marginal cost is 0. Firms 1's 
(the incumbent's) marginal cost is initially 4. By invest- 
ing 1 = 0.205, the incumbent can buy the new technol- 
ogy and reduce its marginal cost to 0. 

(i) Consider the timing: The incumbent chooses whether 
to invest; then the entrant observes the incumbent's in- 
vestment decision; then the firms compete in prices. Show 
that in perfect equilibrium the incumbent does not invest. 

(ii) Show that if the investment decision is not observed 
by the entrant, the incumbent's investing is an equilib- 
rium. Comment. 

(iii) Explain why the conclusion in question i may be 
affected if the entrant faces a fixed entry cost. Does 
it matter whether the potential entrant makes its entry 
decision before or after the incumbent's investment de- 
cision? (Just give the argument; do not compute anything.) 


Exercise 31 [7,8]** 


In a single-period framework, offering a most-favored- 
nation clause is equivalent to charging a uniform price (as 
long as customers have perfect information about trading 
prices), Consider the following model: There are two 
markets and three firms. Each market is one of differentia- 
tion on the line, with two producers located at each ex- 
tremity. The transportation cost is ¢ per unit of distance 
(that is, we assume linear transportation costs), Con- 
sumers are uniformly distributed along the segment, and 
the numbers of consumers in each market are equal (and 
are normalized to 1). The only difference between the 
markets is their length. Market i has length |,, with !, > 1, 
(so the products are, in a sense, more differentiated in 
market 1). Firm 0 (the “chain store” or “national firm’) 
serves both markets. The other extremities of each market 
are occupied by firm 1 (market 1) and firm 2 (market 2) 
(local or regional firms”). The production cost in each 
market for existing firms is c. 

(i) Suppose that firm 0 charges different prices in the 
two markets (ie. it price-discriminates). Solve for the 
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profitable than market 2. 

(ii) Suppose from now on that firm 0 can commit to a 
most-favored-nation clause (that is, it uses uniform pricing 
across narkets). Show that nothing is changed if !, = /3. 
So assyme that J, > /,. Show that uniform pricing lowers 
prices ip market 1 and raises prices in market 2. Show that 
firm 0 jloses from imposing the price-protection policy. 
Derive|the intuition. (Hint: how does the uniform price 
change] when |, + /, is kept constant and 1, —/, in- 
creasesf) Show that a social planner’s preventing firm 
0 from] price-discriminating would lower welfare. (Hint: 
Welfar¢ can be identified with transportation costs in this 
simple | model.) Why isn’t the Robinson-Schmalensee 
result cpncerning the desirability of uniform pricing with 
linear nasil curves (see chapter 3) applicable? (After all, 
the residual demand curves for firm 0 are linear.) The 
model Jused in this exercise is considered by Patrick 
DeGralha in “The Effects of Price Restrictions on Com- 
petitior| between National and Local Firms” (Rand Journal 
of Economics, 18 [1987]: 333-347). DeGraba assumes 
quadratic transportation costs. 

An interesting aspect of DeGraba’s model is that the 
Th f of the local firms are not exogenously fixed. 


Nash fan in prices. Show that market I is more 


These firms can choose to locate near or far from the 
nationa} firm in their respective product spaces. With 
endogehous location choices, uniform pricing changes the 
nature f competition even when |, = /,. The reason for 
this is that a local firm’s moving closer to the center of the 
segment (and thus closer to the national rival) triggers 
a smallpr price cut by the national firm under uniform 
pricing,| as the latter has no reason to cut its price in its 
other njarket. And indeed the local firms do not choose 
maximal differentiation under uniform pricing, whereas 
they dq under discriminatory pricing (see the result of 
d’Aspremont et al., discussed in chapter 7). For his specifi- 
ction Pen finds that transportation costs are lower 
and welfare higher under uniform pricing. 


Exercise 32 [8]* 


Considdr a new industry. Firm 1 can choose one of two 
technolpgies. Technology A requires a fixed cost f, and 
can produce goods at a cost of cy per unit. The costs with 
technolbgy B are fg and cy. After firm 1 has chosen its 
technolpgy, firm 2 can consider entering. It has only one 
vednlbey choice, with costs f and c,. 
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(i) Why might the magnitude of f, determine which 
technology firm 1 chooses? 

(ii) Might firm 1 adopt a different technology if it were 
not facing entry? Why or why not? 

(iii) Relate this model to the ideas of Spence and Dixit 
discussed in chapter 8. 


Exercise 33 [8, 9]** 


When a near-monopolist sets its price so low that a small 
rival goes out of business, the smaller firm is likely to 
charge the larger one with predatory pricing. 

(i) Is the conduct described above always welfare- 
reducing? If not, indicate when welfare might be en- 
hanced. (No formal analysis is necessary.) 

(ii) Areeda and Turner (Harvard Law Review 38 |1975); 
697-733) argued that the larger firm's price should be 
held to be predatory if and only if it is below that firm's 
short-run marginal cost, and they further contended that 
average variable cost could be used as a proxy for 
short-run marginal cost. Evaluate both components of 
this argument. 

(iii) Should established firms be found guilty of a pred- 
atory response to entry if and only if they increase their 
own output when entry occurs? 

(iv) Describe what standards you think the courts 
should use to evaluate charges of predatory pricing. 
Should evidence of intent be considered? Market struc- 
ture? Costs? Comment on the welfare properties and the 
administrative feasibility of your proposal. 


Exercise 34 [8, 10]** 


Can you find a rationale for a law specifying the com- 
pulsory licensing of patents in cases of nonuse or insuf- 
ficient use without “legitimate excuse” within n years 
from the date of the grant of the patent? Discuss potential 
problems of enforcing such a law. 


Exercise 35 [9]** 
Two firms compete in prices. Firm i's demand is 
4, = a — bp, + dp;, 


where b > 0 and —b <d < b, Each firm's unit cost can 
be low (¢,) or high (cy) with equal probabilities. Each firm 
knows its unit cost but not its rival's, 

(i) Solve for the price equilibrium in a one-period 
framework. 
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(ii) Supppse that firms compete in price in each of 
two periods. Does the previous one-period equilibrium 
yield the fitst-period equilibrium prices? If not, explain 
what happens (if you are courageous, solve for the dy- 
namic equilibrium). Does the sign of d make a qualitative 
difference? 


Exercise 34 [9]* 


Review the|various theories linking predation and asym- 
metric information, and compare them. 


Exercise 37 [9]** 


(i) Consider the model of product differentiation on a 
line. The linp has length 1. The consumers are uniformly 
distributed slong it; they face transportation cost ! per 
unit of dist¢nce and have unit demand, with valuation 
» for the gbod. Suppose that 3/2 >v >t. Firm 1 is 
located at the left end of the segment. Firm 2 may or may 
not be locafed at the right end of the segment. (Firms 
have at mogt one product each.) Production costs are 0, 
Show that if firm 2 does not enter, firm 1's monopoly 
profit is »7444, Show that if firm 2 enters, each firm's 
duopoly profit is }(v — #/2) and the duopoly price is 
v— 2, 

(ii) Consider the Milgrom-Roberts model for the setup 
outlined in question i. At date 1, firm 1 is a monopolist. 
Firm 2 obsdrves firm 1's first-period price and decides 
whether to enter. Firm 1 has private information not 
about the production cost (which is 0), but about the 
common deshand parameter v, which can take as a value 
» or &, whete 3¢/2 > 0 > v > t. Suppose for simplicity 
that firm 2 I¢arns v before second-period product-market 
competition jf it decides to enter. Suppose that the entry 
cost lies between 4(v — t/2) and 4(5 — t/2). The dis- 
count factor jis 6. Look for a separating equilibrium, What 
is the price charged by firm 1 when v = 5? Write the 
condition(s) o be satisfied by the price p, charged by firm 
1 when v =|» (how does p, compare with 0/27), Show 
that for 6 + 1 and @ = 31/2, the least-cost separating 
price is py =|t/2 < v/2. 


Exercise 38)(9, 11]** 


Consider the following game between an incumbent firm 
and an entrant: First the entrant decides whether to enter 
(E) or not (N). Then if the entrant enters, the incumbent 
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decides whether to prey (P) or acquiesce (A), The in- 
cumbent and the entrant get 4 and 0 if N, 1 and —1 if E 
and P, and 2 and 1 if Eand A. 

(i) Compute the perfect equilibrium of this game. 

(ii) Suppose that the game is repeated twice (the dis- 
count factor is 6 = 1, say). What is the equilibrium? 

(iii) Suppose that the game is repeated twice. With 
probability 2, the incumbent has payoff as described 
above. With probability 1 — x, he has payoff 3 when E 
and P (that is, he enjoys preying) and otherwise he has the 
same payoff as before. Only the incumbent knows his 
payoff. Does the incumbent with payoffs as before play 
the same strategy as in question ii? Does the entrant learn 
the incumbent's “type” in period 17 

(iv) Solve the game described in question iii (consider 
two cases depending on whether a exceeds } or not). 


Exercise 39 [11]*** 


One interesting issue in a situation of repeated interaction 
between two asymmetrically informed parties is whether 
these parties increase or decrease the “stakes” of their 
relationship over time. Consider two examples. In the 
first, a lender makes a loan to a borrower, who can reim- 
burse or default. The lender has incomplete information 
about the borrower's honesty. It is shown that the size of 
the loan (the stake) increases over time, conditional on 
the previous loans having been reimbursed. The second 
example is the predation model of chapters 9 and 11. The 
entrant can enter two markets held by the incumbent 
sequentially, one big and one small. He does not know 
whether the incumbent enjoys preying. In equilibrium, he 
enters the big market first (decreasing stakes). Meditate 
on the difference between these examples. The first 
example was inspired by J. Sobel’s paper “A Theory of 
Credibility” (Review of Economic Studies 52 [1985]: 557— 
574); the second was suggested by Drew Fudenberg. 


Increasing Stakes 


In a given period, a lender can give C(A) to a borrower 
(with C(O) = 0, C’(0) = 0, C'(A) > 0 for A> 0, C"(A) 
> 0). This investment yields A. The borrower can default 
(keep A) or reimburse (give A to the lender). The 
lender has incomplete information about the borrower's 
honesty. An honest borrower (probability x,) always 
reimburses. A “dishonest” borrower (probability 1— 2,) 
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maximites his expected payoff. Let A* and.A*™* be defined 
by C'(A*) = Land C(A™) = x,. 

(i) Inerpret A* and A**. 

(ii) Sdppose that there are two periods (1 and 2). The 
lender i loan A, > 0 and then A, > 0. Whether the 
borrow@r has defaulted in the first period is observed 
before fhe lender makes the second-period loan. The 
ea factor is 6 < 1. Show that, in equilibrium, A, 
= A,/0| if the first loan was reimbursed and A, = 0 
otherwite. (Hint; Show that for A, < 6A“, A, is always 
reimburted and that for A, > 5A* it is not reimbursed by 
the dishpnest type. What happens for 6A** < A, < 6A"?) 


Decreasing Stakes 


Conside} the chain-store paradox (exercise 11,12). There 
are two periods and two markets. With prior probability 
1 — x,, the incumbent is “sane,” and the payoff structure 
is as in figure 11.12 in the “small market”; in the “big 
market,"| all the payoffs in figure 11.12 are multiplied by 
2. With prior probability x,, the incumbent always preys 
(and thelentrant’s payoff is the payoff or twice the payoff 
indicated in figure 11.12), The discount factor is 6 = 1, 
and x, + 4. The entrant can enter only one market per 
period ahd can choose which market to enter first. 

(iii) SHow that the entrant enters the big market in the 
first peripd, and (possibly) the small market in the second 
period. 


Exercisq 40*** 


Think of/a private rationale for, and potential social con- 
sequencgs of, the following practices relating to patents 
and licensing: 

(i) “pakkage licensing” (licensing of all patents owned 
in a partfcular field and refusal to grant licenses for less 
than the ntire group of patents) 

(ii) “gijant-back licensing” (requiring the licensee to as- 
sign or rant back to his licensor an innovation or im- 
provement discovered in using the patent) 

(ili) “pptent pool and cross-licensing” (in which two or 
more members of an industry make their patents mutually 
available 


Exercise|41** 


True, false, or uncertain? 
(i) "A) durable-good monopolist prefers leasing to 
selling.” 
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(ii) “Price discrimination may destabilize a cartel (under- 
stand: tacit collusion).” 

(iii) “Entry promotes competition and increases social 
welfare.” 

(iv) “The dominant firm always preempts other firms in 
the choice of investments, in product positioning, or in 
the adoption of new technologies.” 

(v) “Price adjustments in oligopoly are more sluggish 
for an upward shock in demand than for a downward 
shock.” 


Exercise 42* 


Discuss the following assertion: “In a monopoly situation, 
the monopoly profit is part of the dead-weight loss.” 
(Think of various situations where it is true, false, or in 
between.) 


Exercise 43** 


Schmalensee, in his chapter on inter-industry studies of 
structure and performance in the forthcoming Handbook of 
Industrial Organization, offers a useful synthesis of cross- 
sectional evidence. Go through his list of stylized facts, 
and think of various theories that are consistent with each 
of them. Here are a few examples to work through: 

(i) The price-cost margin and the accounting rates of 
return are weakly correlated (stylized fact 3.1). 

(ii) In cross-section comparisons involving markets 
in the same industry, seller concentration is positively 
related to the level of price (stylized fact 4.1). 

(iii) Legal restrictions on local advertising in the United 
States are associated with higher retail prices (stylized fact 
4.2). 

(iv) Seller concentration is positively related to esti- 
mates of the minimum market share of an efficient plant, 
and to capital intensity (stylized facts 5.2 and 5.3). 
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